Bk using HYP-smeared staggered fermions in Nf = 2 + 1 unquenched QCD 
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or domain- wall sea quarks. 
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I. INTRODUCTION 



where 



CP violation was first observed, long ago, in neutral 
kaon mixing. The long-lived eigenstate Kl has a small 
CP-even impurity, allowing it to decay into two pions. 
The amount of this impurity is parametrized by s, whose 
experimental value is 



s = (2.228 ±0.011) x 10" 



(1) 



In the standard model, CP violation is induced by box 
diagrams involving virtual W and Z bosons, which lead 
to the prediction 



e = exp(i0 £ ) v / 2sin(0 £ ) C e lm\ t X B K + £, (2) 
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For the values of the parameters, see, e.g., Ref. 0. The 
key point is that, aside from known kinematic factors and 
QCD Wilson coefficients, e is given in terms of elements 
of the CKM matrix (occurring in the Xi) and the hadronic 
matrix element Bk- This matrix element has the form 



B K = C(fi)B K (!u) 

E !/ <^o|[57,(l-75)rf][s74l-75)d]|^o) 



BM = 



\(K \s l0l5 d\0)(0\s l0 ~f 5 d\K ) 



(4) 
(5) 



where C(fi) is the Wilson coefficient which makes Bk RG 
(renormalization group) invariant (as discussed further in 
Sec. lIII B|) . /i is the renormalization scale of the operator, 
and we have assumed that the kaons are at rest. Thus 
if one can determine the value of Bk, one can constrain 
the CP-violating part of the CKM matrix. 

It has been a long-standing goal of lattice QCD to cal- 
culate Bk- The challenging part of the required ratio 
of matrix elements, Eq. is the numerator. This is, 
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at first sight, relatively straightforward to evaluate, be- 
cause it involves only a single, stable particle in both 
initial and final states. A complication arises, however, 
from the left-handed structure of the four-fermion op- 
erator. This constrains its behavior in and around the 
chiral limit. In particular, at leading order in chiral per- 
turbation theory, the matrix element in the numerator is 
proportional to rn K . This vanishing is not guaranteed if 
one uses a formulation of lattice fermions in which chiral 
symmetry is broken. Because of this, most recent lattice 
calculations of Bk have used fermions in which one has 
at least some remnant of chiral symmetry — domain- wall, 
staggered, overlap and twisted-mass fermions. 

After a long history of quenched calculations, advances 
in algorithms, methodology and computers have allowed 
calculations of Bk using light dynamical quarks. Partic- 
ularly noteworthy are two "2 + 1 flavor" calculations in 
which all three of the physical light quarks are dynam- 
ical, although the up and down quarks are kept degen- 
erate!]] These calculations both use valence domain- wall 
fermions, with one using asqtad staggered sea quarks Q, 
and the other domain- wall sea quarks [1, H[ . Both cal- 
culations have produced results in which all sources of 
systematic error are controlled, and the total error is less 
than 5%. In particular, both calculations use two lattice 
spacings to allow a continuum extrapolation. The results 
of these two calculations are consistent. 

It is important for a parameter such as Bk , which is 
used to constrain the standard model, to have multiple 
calculations using different lattice methods. In partic- 
ular, we think it is important to use different valence 
fermions. Thus we have undertaken a calculation using 
valence staggered fermions. In particular, we use HYP- 
smeared improved staggered fermions, since these are 
known to reduce taste-breaking, and are computation- 
ally inexpensive to implement. We use the same asqtad 
lattices as in Ref. @, except that we add a third, smaller 
lattice spacing (a w 0.12, 0.09 and 0.06 fm). An impor- 
tant feature of our approach is that the four-fermion op- 
erators are constructed using HYP-smeared (rather than 
thin) links. This is known to reduce the size of pertur- 
bative corrections [1] and scaling violations Q- 

There has been one previous calculation of Bk using 
valence staggered fermions on the asqtad, Nj = 2 + 1 
lattices @ . This work differs from ours in that they used 
thin links to construct the operators, and only worked 
on the a w 0.12 fm lattices. As a consequence, the result 
contains potentially large discretization and matching er- 
rors. 

A drawback of using staggered fermions is that each 
flavor comes with 4 tastes, and one needs to use to the 
rooting prescription to reduce the number of sea quarks 
to the physical complement. Rooting leads to unphysical 
effects at non-zero lattice spacing. We assume, following 



1 We also compare to results from calculations with two degenerate 
dynamical quarks in the concluding section. 



the arguments of Refs. that the effects of root- 

ing vanish in the continuum limit, and are taken into 
account at non-zero lattice spacing by our use of ex- 
trapolations based on staggered chiral perturbation the- 
ory (SChPT). The phenomenological successes of rooted 
staggered fermions 13] provide support to this assump- 
tion. 

This paper is organized as follows. In Section m we 
derive the necessary next-to-leading order (NLO) SChPT 
results, first for SU(3) ChPT including the effects of our 
use of a mixed action, and then for SU(2) ChPT. In Sec- 
tion lllll we explain our methodology, describe how we ob- 
tain results for Bk, and then present our fits using both 
SU(3) and SU(2) SChPT. In the SU(3) case we must use 
constrained, Bayesian fitting, given the complexity of the 
functional form. We also discuss the light sea-quark mass 
dependence and the continuum extrapolation. We close 
the section by presenting final results and error budgets 
for both SU(3) and SU(2) analyses. We conclude in Sec- 
tion IIVI with a comparison of our result with previous 
work and some comments on our future plans. In four 
appendices we present, respectively, the functional forms 
needed for SU(3) fitting, the functional forms needed for 
SU(2) fitting, a derivation of several key results needed to 
obtain the SU(2) functional forms, and tables of a subset 
of our numerical results. 

Preliminary results from this project have been pre- 
sented in Refs. [IM^. 



II. STAGGERED CHIRAL PERTURBATION 
THEORY 

To extrapolate our data to the physical (average) light 
quark mass, and to the continuum limit, we use staggered 
chiral perturbation theory (SChPT). We have carried out 
fits using the functions predicted by both SU(3) (strange 
quark treated as light) and SU(2) (strange quark treated 
as "heavy") SChPT. The purpose of this section, and 
the associated appendices, is to derive these functions 
and present them in an explicit form suitable for fitting. 

For SU(3), a next-to-leading order(NLO) SChPT anal- 
ysis of Bk has been given in Ref. [lcf . That work, how- 
ever, considered the theory in which the sea and valence 
quarks had the same action. In our set-up, by contrast, 
we have asqtad sea quarks (since we use configurations 
generated by the MILC collaboration [l!|) and HYP- 
smeared valence quarks (as described in the following 
section) . Thus we must generalize the results of Ref. [18[ 
to a mixed-action theory. We also present the results in 
an explicit notation rather than using the compact but 
dense notation of Ref. [l8[ . 

The use of SU(2) ChPT has been advocated in 
Ref. [20|, and the continuum partially quenched NLO 
predictions for a variety of quantities, including Bk , have 
been presented. These results are directly applicable to 
domain-wall fermions, and have been used for extrapo- 
lating Bk in Ref. [2l|. Here we generalize the results to 
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staggered ChPT, finding that the complications of the 
SU(3) NLO SChPT form are greatly reduced. 

Before presenting the derivations we describe our nota- 
tion for the quantities that appear in the SChPT expres- 
sions, and explain how we obtain the numerical values 
for these quantities. We work throughout in the isospin- 
symmetric limit, and label the up and down sea quarks 
collectively as I. Thus m; is the common light sea-quark 
mass, while m s is the mass of the strange sea quark. 
Following Ref. ;18|, valence down (anti-)quarks are la- 
beled x while valence strange (anti-)quarks are labeled y. 
These have masses m x and m v , respectively. As noted 
above, the sea quarks use the asqtad action, while the 
valence quarks use the HYP-smeared staggered action 
(more precisely the HYP (II) choice in the convention of 
Refs. [1,[I3)- 

Many pseudo-Goldstone bosons (PGB) appear in the 
NLO chiral expressions. The mass of the "kaon" com- 
posed of ir-quarks and y-antiquarks (or vice- versa) is de- 
noted m xy: B, where B is the taste. The taste can be 
I, P, V, A or T, standing, respectively for taste scalar 
[1], pseudo-scalar [£ 5 ], vector axial [£ m5 ] and ten- 
sor [£ M „]. Here we use the result that at leading order 
(LO) in SChPT there is an accidental SO (4) taste sym- 
metry [23|. Similarly, the masses of "pions" composed 
of x-quarks and x-antiquarks are labeled m xx: B, while 
those of PGBs composed of y-quarks and antiquarks are 
denoted m yy -B- In the following, PGB masses are always 
in physical units, converted from the measured lattice 
masses using the scales determined by the MILC collab- 
oration [HIHII ■ These scales are given in Table ITVl b elow . 

To simplify the chiral expressions, we also use the fol- 
lowing shorthand notations: 

• The squared masses of the valence kaons are de- 
noted by Kb = , m 2 xy . B . We also use a special short- 
hand G = Kp for the Goldstone-taste kaon. This 
is the particle which we use for the external states 
in our calculation. 

• The squared masses of the valence pions are de- 
noted: Y B = m xx -B- Similarly, we use Yb — m yy . B . 
Note that for tastes B = I, V and A the pion masses 
have LO contributions from quark-disconnected di- 
agrams involving "hairpin" vertices. These are, by 
definition, not included in Xb and Yb- In other 
words, these mass-squareds are those of the pions 
in which the quark and antiquark are taken (im- 
plicitly) to have different flavors. 

• We also need the squared masses of analogous flavor 
non-singlet pions composed of 11 and ss quarks. We 
call these Lb = m-u-B an d Sb = "m 2 s . B , respectively, 
for B = I, V and A. 

• Finally, we need the squared masses of the t]b and 
i]' B pions for B = V, A and I. We call these masses 
simply t]b and r/' B . The t/b and r] B are the flavor 
singlet, sea-quark mesons that result from the in- 



clusion of mixing between 11 and ss pions due to 
hairpin diagrams [25j . 

We now address how we obtain the numerical values 
of the various masses discussed above. The valence kaon 
and non-singlet pion masses Kb, Xb, and Yb are ob- 
tained from our simulations in an ancillary calculation. 
In fact, we find to an extremely good approximation that, 
for our range of valence quark masses, the expected LO 
forms holdjj 

K B = h(m x + m y ) + A B (6a) 
Y B = b 1 (2m x ) + A B (6b) 
Y B = 6 1 (2m y ) + A B . (6c) 

In practice we do a fit to these equations and extract &i 
and the Ab, and then use the latter to reconstruct the 
mass-squareds. More details on these fits are given below 
in Sec. iLTTAl 



TABLE I. Masses of flavor non-singlet, taste P sea-quark 
masses, in lattice units [24j. Ensemble labels are from 
Table ITTTI below. 



Ensemble 


a\J Lp 


oV Sp 


CI 


0.3779 


0.4875 


C2 


0.3113 


0.4889 


C3 


0.2245 


0.4944 


C4 


0.1889 


0.4923 


C5 


0.1597 


0.4913 


Fl 


0.1479 


0.3273 


F2 


0.1046 


0.3273 


SI 


0.09371 


0.2075 



TABLE II. Taste splittings, r\ Ab, for flavor non-singlet pions 
composed of sea quarks [24]]. These are for the ensembles C3, 
Fl and SI (see Table ITTTI below). 



taste 


coarse 


fine 


superfine 


r{A A 


0.2053 


0.0706 


0.0253 


r\A T 


0.3269 


0.1154 


0.0413 


rfAy 


0.4391 


0.1524 


0.0552 


rlAs 


0.5370 


0.2062 


0.0676 



In the case of pions composed of sea quarks, we obtain 
the flavor non-singlet masses Lb and 5" B from MILC col- 
laboration results [l9|, [24j . The sea pion and "ss" meson 
masses are quoted in Table HI while taste splittings are 
collected in Table |TTJ We do not include the errors on 
these splittings in our analysis as their impact is much 
smaller than other sources of error. This is because the 



2 SChPT predicts that Ap = 0, although in practice the missing 
NLO terms together with finite- volume effects can lead to a small 
non-zero value for Ap. 



errors are very small, and furthermore enter into a nu- 
merically small part of the one-loop correction. We use 
the same taste-splittings for all the coarse ensembles, and 
similarly for both the fine ensembles, which we expect to 
be a very good approximation based on our results for 
valence pions. 

For the flavor singlets sea-quark mesons, the taste- 
singlet is simplest, since the rj[ (which corresponds to 
the J]' of QCD) is not a PGB and can be integrated out. 
This leads to the following result for the 771 mass: [25[: 



m = -L l + - Sl . 

For tastes B=V and A, we use the result of Ref. [2f 



77B = m^. B = i ( L B + S b + ^ra 2 S /S B - Z B 



Vb = ™ v .b = - [L B + S B + -a S : 



Hi 



1 



9 
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H B = Sb — Lb ■ 



(7) 



(8) 



(9) 



(10) 
(11) 



The superscript on the hairpin vertices a 2 (5'^v will be 
explained in the following section. Their numerical val- 
ues are taken from the fits of Ref. [l^] . Note that their 
dimensions are given by 8' B oc Aq CD . 

The taste violating parameters on the coarse lattices 
are M 



r\a 2 b~' A = -0.30(1)(3) 
r\a 2 5' v = -0.05(2)(3). 



(12) 
(13) 



These come from fits using the "smoothed" r*i values that 
we adopt, and that are given in Table HVl below. As for 
the sea-quark taste splittings, we fix these parameters 
to their central values and ignore the errors. For the 
other lattice spacings we scale the parameters assuming 
an a 2 a 2 s dependence. 



A. Mixed action effects in SU(3) SChPT 

The inclusion of mixed-action effects into SChPT can 
be accomplished by a straightforward generalization of 
the methods worked out in Refs. [26| - [29j . A mixed action 
can be described field-theoretically at the lattice level in 
the same way as a partially quenched (PQ) theory: there 
are valence quarks, corresponding ghosts, and sea quarks. 
The difference is that there is no longer a symmetry in- 
terchanging valence and sea quarks. This symmetry only 
emerges in the continuum limit, where the choice of dis- 
cretization does not matter. What this means is that one 
has a standard PQ set up when a — > 0, but, for a ^ 0, 
the corrections introduced by discretization errors in the 
valence, sea and mixed valence-sea sectors [parametrized 
by low-energy coefficients (LECs)] are unrelated. At first 
sight (and in light of the complexity of the analysis of Bk 



(a) 





FIG. 1. Classes of quark-line diagrams contributing to Bk 
at 1-loop order. The four-quark operator is represented as 
two bilinears which change a valence d-quark into a valence 
s-quark. These bilinears are shown as either (blue) squares 
(if the bilinear connects to both quark and antiquark from an 
external kaon) or as (red) hexagons (if the bilinear connects a 
quark from one kaon to an antiquark from the other). For each 
of the three diagrams there are others (not shown) related 
by interchanging the roles of quark and antiquark. In (c) 
the "hairpin vertex" contains, implicitly, a sum of "bubbles" 
composed of sea quarks. 



in an unmixed context [18l|). the required generalization 
appears rather daunting. It turns out, however, that this 
is not the case for our particular application, since nearly 
all of the new LECs do not enter at NLO in Bk- 

The easiest way to see this is to look at the quark-flow 
diagrams which contribute. These have been presented 
in Ref. jl8} , and we display them in Fig. [T] Only valence- 
quark lines occur in Figs.[TJa) and (b), so that the PGBs 
entering in loop diagrams are all of the "valence- valence" 
type. The form of the contributions from these diagrams 
will thus be unchanged by the use of a mixed action, 
and we can simply use the results of Ref. [18j. The 
contributions of these diagrams give what is called the 
"connected" part of the matrix element, A^ C onn, where 
connected means that the loops do not involve hairpin 
vertices. We present a fully explicit form for A^conn in 
the next section. We stress that the values of the LECs in 
A'fconn are altered by the change from unmixed to mixed 
action, but this has no practical consequences as these are 
a priori unknown parameters that are either determined 
from the spectrum or by the fits to Bk itself. 

Another feature of the quark-line diagrams is that 
no "valence-sea" mesons appear at NLO. Thus the new 
LECs associated with such mixed mesons do not enter 
into our result. 

New LECs do appear in the "disconnected" matrix el- 
ement, A^disc, for which the quark-line diagram is shown 
in Fig. QJc). The hairpin vertices in the loop implic- 
itly contain a sum over any number of insertions of sea- 
quark loops, corresponding to the propagation of "sea- 
sea" PGBs. This means that three types of hairpin vertex 
enter, instead of the single type for an unmixed action: 
a 2 S' B ', for the hairpin connecting valence- valence pions 
to themselves; a 2 8' B , for the hairpin connecting valence- 
valence pions to sea-sea pions, and a 2 5' B , for the hairpin 
connecting sea-sea pions to themselves. The last of these 
has already appeared in Eqs. (|51 TTU1) . These hairpins oc- 
cur as LO vertices only for tastes B=V and A, so that, for 
Bk, the change to a mixed action introduces only four 
new LECs: d'y , 8'™ , S'y and 5'™ . We stress that these 
hairpin vertices are independent of the quark masses. 
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Hairpins also occur at LO in the taste singlet channel, 
but here the dominant contribution is that which leads 
to the bulk of the physical rf mass. This is a contin- 
uum contribution, which is therefore independent of the 
fermion action, and thus is common to vv, vs and ss ver- 
tices. Small differences between these vertices of 0(a 2 ) 
do not contribute to Bk until next-to-next-to-leading or- 
der (NNLO). Thus the three types of vertices are the 
same at LO in the taste-singlet channel, and the result 
of Ref. [H} for this channel is unchanged. 

In light of the preceding discussion, we see that the in- 
gredients that change when moving from a PQ (unmixed 
action) theory to a mixed-action (MA) set-up are the 
hairpin propagators, D®f Q (q), D®f Q (q), and D^ PQ (g), 
where B=V or A labels the taste. We first recall the 
form of these propagators in a theory with an unmixed 
action [18]. The "xy" hairpin propagator is 



1 



(q 2 



(q 2 +X B )(q 2 
L B )(q 2 + S B ) 



■RB(q), (14) 



(q 2 + m)(q 2 + v B ) 



(15) 



[this is Eq. (42) of Ref. [l8| restricted to the case of 2 + 1 
flavors] . The expressions for rj B and r]' B are those given in 
([8]) and J9]) above except that 5' B is replaced by the sin- 
gle, common, hairpin vertex i5'b- The results for -D^j P< ^ 
are simply obtained by the replacement Y B — > Xb , while 
the opposite replacement yields the D^ p Q. Similar re- 
placements hold for the mixed-action theory, and so we 
discuss only D xy below. 

The result (IT4l can be understood as follows: a 2 5' B is 
the hairpin vertex, R,B(q) is the result of the sum over 
sea-sea mesons "within" the hairpin vertex, and the re- 
maining two propagators are those of the "external" xx 
and yy pions. One can then see how the result changes 
when the hairpin vertex splits into three types. If there 
are no sea-quark loops, one has the vv hairpin vertex 
and the external propagators, but no factor of _/?b(<7)- If 
there are sea-quark loops, then one has a common fac- 
tor of (S' B ) (from vv — > ss and ss — > vv), and the 
sum R B (q) except for the first term, with one factor of 
S' B removed to avoid double-counting. Thus one finds 
that D^ p Q(q) should be replaced with the mixed- action 
hairpin propagator 



nB,MA 



(q) 



1 



(q 2 + X v )(q 2 +Y v ) 

~vs\2 " 



Sb ss Sb / 



a 6 B + {a 5 B ) 



"•PQ(g)-0 a 



(q 2 + X B )(q 2 + Y B ) 



(16) 
(17) 



From the second line, we see that the impact of using a 
mixed action is twofold. 

1 . One must replace the overall factor multiplying the 
loop contribution calculated in the unmixed case as 
follows: 

S > B ^S^ = $ff-, B = V,A. (18) 
"b 

The form of the contribution will be otherwise un- 
changed, since the q dependence in the first term 
of (fV?]) is the same as that of L>^ PQ (<j). We also 
note that our lack of knowledge of the two <5{^ A1 
does not introduce new fit parameters since these 
hairpin vertices multiply unknown operator coeffi- 
cients from the chiral theory [as shown explicitly in 
Eq. (TAT6J)], 

2. In addition, one must evaluate a new loop integral, 
in which propagator has the same form as for a 
quenched hairpin, and in which the overall constant 
is 

E V B) , B = V,A. (19) 

S B 

The result is given as F B 2) in Appendix El This 
does introduce two new fit parameters. 



We close this section with a comment on the expected 
magnitudes of the four new LECs. The hairpin vertices of 
SChPT are the result of mapping four-quark operators in 
the Symanzik effective theory into the chiral theory. The 
structure of the underlying four-quark operators mirrors 
that of the chiral operators they produce. For example, 
the hairpin vertex proportional to S' B arises from an 
operator containing a valence-quark bilinear and a sea- 
quark bilinear, while that proportional to <5' B arises from 
an operator containing two valence-quark bilinears. The 
important observation is these four-quark operators also 
give non-hairpin taste-dependent contributions to the 
pion masses (viewed as having the gluons exchanged in 
the t-channel rather than the s-channel). The operators 
leading to S' B V , 5' B S and 5' B lead, respectively, to taste 
splittings for valence- valence, valence-sea and sea-sea pi- 
ons. But we know that the valence HYP-smeared action 
reduces taste splittings by a factor of « 3 compared to 
that for the sea-quark asqtad action [3(| • Assuming that 
this is a universal reduction, applicable to all four-quark 
operators, we conclude that S' B V /S' B ss 1/3. Since the 
improvement in taste-splittings arises from a reduction of 
the coupling of quarks to high-momentum gluons, we ex- 
pect that it "factorizes" , so that S' B S /8' B w l/\/3. Given 
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these assumptions, we are led to the expectations 

5™ A1 « S'b/3 and <^ IA2 « . (20) 

If these expectations are close to correct, then the taste V 
and A hairpin contributions are significantly reduced by 
the use of the mixed action: the "old" contributions from 
D^ p Q(g) are reduced by a factor of ~ 3, while the "new" 
5# A2 terms are suppressed. We use these expectations 
to simplify our fitting strategy. 

B. SU(3) SChPT result 

In this subsection we summarize the NLO result for 
B K in SU(3) SChPT— obtained from Ref. pj] with the 
generalization explained in the previous subsection. The 
utility of this result depends on the extent to which one 
can treat the strange quark mass as light compared to 
Aqcd- We will attempt to judge this a posteriori based 
on how successful we are in fitting with the SU(3) form. 

Discretization errors in Bk arise from the action and 
from the four-quark operator. If they involve taste- 
breaking then they are, for both HYP-smeared and asq- 
tad actions, of 0(a 2 a 2 ), while non taste-breaking errors 
are of 0(a 2 ) for HYP-smeared quarks and of 0{a 2 a) for 
asqtad quarks. The different factors of a are, however, 
misleading, since numerical results indicate that taste- 
breaking effects are enhanced. Furthermore, as noted 
above, different actions can lead to significantly differ- 
ent numerical sizes of taste-breaking effects, although 
formally these effects are of the same order. Thus, fol- 
lowing Ref. we adopt the phenomenologically-based 
power-counting scheme in which all discretization errors 
are treated as being of 0(p 2 ), irrespective of the num- 
ber of powers of a. In light of this choice, and for the 
sake of brevity, we refer below to all discretization errors 
as being simply of "0(a 2 )", except where the associated 
powers of a are important. 

Additional errors arise if one uses perturbatively cal- 
culated matching factors to relate the lattice operator 
to its continuum counterpart. We use one-loop match- 
ing, and furthermore keep only the subset of the one-loop 
induced operators involving bilinears with taste P. This 
results in "mixing errors" proportional to both a s — from 
one-loop mixing with operators with tastes other than 
P — and a 2 — from the unknown two-loop mixing with all 
operators, including those with taste P. As explained in 
Ref. [l8[, it is reasonable phenomenologically to treat 
both these effects as also of 0(p 2 ). The justification 
for treating 0(a) ~ 0(a 2 ) is that the one- loop coeffi- 
cients of the operators we drop are known to be small, 
while two-loop coefficients are unknown and might not 
be small. Indeed, with HYP-smeared fermions, we find 
that the relevant one-loop coefficients are of typical size 
a s /(47r) fa 0.08a s dHHl]], an< ^ * nus ex P ec t that their ef- 
fects are noticeably smaller than those of 0(a 2 s ). In light 
of this, we refer to truncation errors below as simply of 
0(a 2 ) unless we need to be more specific. 



Using this power-counting scheme, Ref. [18| deter- 
mined all the operators in SChPT that must be included 
at NLO, and carried out the required one- loop calcula- 
tion. This involves using LO propagators in the loops, 
which implies that the accidental SO (4) taste symme- 
try [IH Ha] holds for their masses. As already noted, we 
determine these masses from an ancillary spectrum cal- 
culation on the same configurations, or from published 
MILC results, so that they are not unknowns in the fits. 

The resulting expression for Bk can be written as 

16 

< (3) = £m^- (2D 

Here the coefficients bj are related to unknown SChPT 
LECs, while the Hj are known functions of the (numeri- 
cally determined) PGB masses. In the following we run 
through these 16 terms in turn, explaining their origin 
and their properties. The explicit expressions for the Hj 
are collected in Appendix [A"l 

The dominant contribution to Bk comes from the 
function 

#i = 1 + ^TWr M»» + M disc] • (22) 

This contains the LO contribution (the "I") together 
with the "continuum- like" loop contributions to A4 conn 
and A^disc- By "continuum- like" we mean that the loga- 
rithms have the continuum form (thus the superscript 0, 
indicating a — > 0), except that the pions in the loops have 
taste-splittings included in their masses. If these taste- 
splittings are set to zero, then M® onn and M^ isc . go over 
to the continuum chiral logarithms. Assuming that f n 
is known, the coefficient of the chiral logarithms is pre- 
dictedH Since we know the values of the required PGB 
masses, we know the value of the function Hi for each 
choice of valence and sea-quark masses. For this reason 
it makes sense to lump these terms together into a single 
function, H\. The explicit forms for M° onn and A4° isc 
are given in eqs. (|A! [) and (|A8I) . respectively. It turns 
out that A4" isc vanishes for degenerate valence quarks. 

The chiral logarithms in (|2"21 have the generic form 
m 2 K lnm^. Since an explicit 1/G factor is taken out, the 
functions A4° onn and A4 ^ isc have generic form m K In vtik- 

The coefficient of H\ is given by 

b x = B + 0(a 2 ,a 2 ). (23) 

Here Bg is the value of Bk in the combined SU(3) chiral 
and continuum limits. Bq is a quantity that is of inter- 
est for continuum ChPT. The remaining terms indicate 
the manner in which b\ approaches the continuum limit: 



3 As discussed below, our data is insufficient to determine f n from 
the fits. Thus we use a fixed value, which we vary among rea- 
sonable choices, as discussed below. 
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there are contributions from both non-taste breaking and 
taste-breaking discretization errors, and from the trun- 
cation of perturbation theoryQ When fitting at a single 
lattice spacing, one can treat b\ as a single constant. The 
scaling violations must be accounted for when extrapo- 
lating to the continuum limit. Alternatively one can at- 
tempt a combined continuum-chiral fit, as has been done 
by the MILC collaboration [19|. We have not done so 
here. 

The next four terms are the analytic NLO (and, in the 
case of H 3 , NNLO) corrections: 



are labeled by 



H 2 


= G/K 


(24) 


H 3 


= (G/A 2 x f 


(25) 


Hi 


(X P - Ypf 
GAl 


(26) 


H 5 


L P + Sp/2 


(27) 



All of these terms are present in the continuum limit, and 
to the order we work their coefficients are independent 
of a and the quark masses. We have chosen to write 
them in terms of the PGB masses rather than the quark 
masses. This allows us to set the scale of these terms by 
the chiral scale A x which we take to be 1 GeV in our 
fitting. Then, assuming naive dimensional analysis, we 
expect the coefficients of these terms bi — &4 to be of 0(1). 

We find that the NNLO analytic term, b^H^, is nec- 
essary to obtain reasonable fits. The need for a NNLO 
term is not surprising given relatively large kaon masses 
included in our SU(3) fits. It is, perhaps, surprising that 
we can get away with only a single term beyond NLO, 
given that the MILC collaboration requires NNNLO and 
NNNNLO terms in their fits to pion and kaon proper- 
ties pjj . The difference appears to be that the errors in 
Bk are larger than those in the quantities studied by the 
MILC collaboration — PGB masses and decay constants. 

We stress that we are not including the complete 
NNLO expression (which is not known). In particu- 
lar, we are not including non-analytic NNLO terms, and 
are keeping only one of the NNLO analytic terms. Our 
approach here is phenomenological — we are keeping the 
minimum number of NNLO analytic terms needed to ob- 
tain reasonable fits. 

The remaining 11 Hj are pure lattice artifacts, caused 
either by discretization errors or the truncation of match- 
ing factors, or both. They all arise from taste-violating 
interactions The first 7 are corrections to A4 conn , and 



4 In this case there are no terms linear in a [l8| . 

5 There are also taste-conserving discretization and truncation er- 
rors; these lead to the corrections in the expression for the coef- 
ficient bi, Eq. (f23l . 



H 6 
H 7 

H 8 

H10 
An 
H12 



(4) 
I 

(4) 
P 

(4) 
V 

(4) 
A 

(4) 



T 



(28) 
(29) 
(30) 

(31) 
(32) 
(33) 
(34) 



The explicit functional forms of F-^\ F^ 5 \ and are 
given in eqs. (1A~9|) . (|A~i~0]) and (|A"TTj) . respectively. The 
subscript on F^ indicates the taste of the pion in the 
loop. The functions F^ and F^ vanish in the limit of 
degenerate valence quarks. 

The chiral logarithms in the functions #6-12 are en- 
hanced over the continuum-like logarithms in H 1 . While 
the latter are of standard form m 2 K In tuk , the former are 
simply proportional to lnm/f- This enhancement of the 
chiral logarithm is balanced by the fact that the coeffi- 
cient, bj, is small: 



QCD 



« 2 ) 



(35) 



The final four functions Hj with j — 13 — 16 arise from 
corrections to .Mdisc due to discretization and truncation 
errors. These are the functions which are affected by our 
use of a mixed action, as explained above in Sec. Ill Al 
H13 and Hn are the loop integrals involving the hairpin 
propagator Z3^ p< *(g r ) [i.e. the first term in Eq. (|17)) ], 
They are, for B=V and A, respectively, 



#13 
Hi 4 



(1) 

V 

(1) 



(36) 
(37) 



and are multiplied by coefficients proportional to 6y A1 
and S^ A1 . This leaves the two functions multiplied by 
the two additional LECs introduced by our use of a mixed 
action, i.e. <5y A2 and S^ A2 . These are, respectively, 



Hie 



F 



(2) 



F 



v 

(2) 



(38) 
(39) 



The functions F^ and F^' are given in Ap- 
pendix (|A 3[) . They are yet more divergent in the chi- 
ral limit than f( 4 ~ 6 ) ; behaving generically as lnm^/m|- 
(although they vanish when m x = m y ). The power- 
counting is restored by coefficients [whose explicit expres- 
sions in terms of chiral coefficients are given in eqs. (|A16|) 
and (|A18[) ] which are proportional to the product of two 
factors arising from discretization or truncation errors: 



p(2) 



bi 



3-16 



Aqcd X 



(a 2 A 2 



QCD 



or a 



a 2 5™ A2 ) 



(40) 
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(We recall that 5j^ A1 ' 2 oc A 



4 

QCD- 



As discussed at the 



end of Sec. Ill A[ we expect Sq A1 to be suppressed by a 
factor of 3 from its natural size, and for S^ A2 to be close 
to zero. Thus we expect the contributions from H13 and 
H14 to be small, while those from H 15 and Hi 6 to be 
negligible. 



C. SU(2) SChPT result 

In the recent literature there has been much discussion 
of the convergence and reliability of SU(3) ChPT when 
extrapolating lattice results to the physical kaon mass. 
As examples, we contrast the apparently successful ex- 
trapolations of Refs. 0, EH EH (the latter being for Bk 
using a domain- wall valence/ staggered sea mixed action), 
with the lack of convergence found in Ref . |jj32[ . Recent 
reviews of the situation are given in Refs. [33l [34|. and 
a related discussion in continuum ChPT is in Ref. [35| . 
Our view is that whether SU(3) ChPT can be used reli- 
ably depends on the quantity considered (the rate of con- 
vergence is not universal) and on whether one includes 
NNLO and higher order terms (these are needed to repre- 
sent data in the region of the physical kaon mass) . Thus 
we have attempted SU(3) fits using the theoretical form 
described in the previous subsection (which includes one 
NNLO term), and hnd the reasonably successful results 
to be described below. 

It is nevertheless true that, for the non-degenerate lat- 
tice kaons which lie closest to the physical kaon, and 
which thus have the dominant effect on the extrapolation, 
the non-analytic dependence comes dominantly from chi- 
ral logarithms involving only the light quarks (x and £ in 
our notation). Chiral logarithms of mesons containing 
strange quarks (i.e. of Kb, Sb, Yb, Vi an d v'b^) can 
be expanded about their values when the strange quark 
mass equals its physical value and represented accurately 
by analytic terms. This means that, in effect, one is us- 
ing an (approximate) SU(2) ChPT expression for those 
kaons closest to the physical kaon even when nominally 
doing an SU(3) extrapolation. 

The RBC collaboration were led, by considerations 
along these lines, together with the poor convergence 
they found for SU(3) ChPT, to propose the use of SU(2) 
ChPT for the extrapolation of kaon (and pion) proper- 
ties [![. In this approach, first considered systematically 
in Ref. [36[ and extended in Refs. [1, [2l|, the kaon is 
treated as a heavy, static source, and no expansion is 
made in the mass of the (valence or sea) strange quark. 
The expansion in powers of (m n /A x ) 2 is supplemented 
by an expansion in (m^/mx) 2 - In our calculation, this 
means that the SU(2) ChPT result will apply only to the 



We do not include r\y or r\A in this list because the strange-quark 
component is small and these mesons are light. This can be seen 
from Eq. {5} , which in the limit that a?8L <g | Sb — L&\, which 
is the case in practice, gives r?e ~ Lb + a S' B . 



subset of our non-degenerate masses in which m x is small 
and m y is large. 

This methodology has been applied successfully to a 
calculation of Bk using domain-wall fermions by the 
RBC collaboration [2l|. Since they use lattice fermions 
with an almost exact chiral symmetry, the corresponding 
ChPT result is that of the continuum, except that the 
LECs can depend on a 2 . For our calculation, however, 
what is needed is an SU(2) staggered ChPT result. At 
first sight, this would appear to require a generalization of 
the rather involved enumeration of operators performed 
in Ref. [l8j to the case of SU(2) chiral symmetry with 
a heavy kaon source. It turns out, however, that a sim- 
pler approach suffices. One can show that it is sufficient 



to consider the m x ,ni£ <C m. 



limit of the next- 



to-leading order SU(3) SChPT result, provided that one 
allows the LECs to depend on m y and m s in an unknown 
(but analytic) way. In addition, one can show that the 
size of the chiral logarithms involving m x and mi relative 
to the LO term is not corrected by an unknown m s de- 
pendence. Thus the chiral logarithm remains a predicted 
correction, as in SU(3) SChPT. 

The validity of this "recipe" is demonstrated in Ap- 
pendix [C] The argument holds to all orders in an expan- 
sion in powers of m s /AQCD- 

Applying this recipe we find an important simplifi- 
cation: the NLO terms in SU(3) SChPT that involve 
taste-breaking LECs proportional to a 2 or a 2 get pushed 
to NNLO in SU(2) SChPT. This drastically reduces the 
number of parameters required in a fit to the NLO ex- 
pression. How this happens is explained in detail in Ap- 
pendix [B] 

The final SU(2) SChPT result for B K is 



Bk — ^ diQ4 



(41) 



where the functions that appear are 



Qi = 1 



32tt 2 / 2 



(Li - XiK(Xt) 



Xp 

A 2 

x 

v 



Q 2 



Bp_ 

A 2 

x 



(42) 

(43) 
(44) 
(45) 



The chiral-logarithmic functions I and I are defined in 
Eqs. (IA4|) and (|A5I) . and the factors t b , which are the 
fractional multiplicities of the different tastes, are given 
in Eq. (IA3|) . Note that we have kept a single analytic 
NNLO term, Q 3 , as for the SU(3) SChPT fits. As in the 
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SU(3) fits we fix A x = 1 GeV, although we could also use 
A x = m K iys . 

The constants di are arbitrary, unknown, analytic 
functions of m s and m y . In addition, at NLO d\ con- 
tains taste-conserving discretization and truncation er- 
rors (which it inherits from bi). Specifically, one has 

d 1 = B^ U{2) (m y ,m s ) + 0(a 2 , a 2 ). (46) 

where the first term is the value of Bk in the SU(2) 
chiral limit (m x — nig — 0) and with the given values of 
TYiy and m s . The 0(a 2 ,a 2 ) terms also have an implicit 
dependence on m y and m s . 

Clearly the SU(2) SChPT result is much simpler than 
that in SU(3) SChPT— at NLO, it has only 3 unknown 
coefficients at a fixed lattice spacing, to be compared 
to 16 for the SU(3) form. This gain is compensated in 
part by the fact that it can be used only for a small 
subset of our data, and because the fit parameters have 
an implicit dependence on m y and m s . Nevertheless, we 
find, as described below, that the SU(2) fitting is more 
straightforward and leads to smaller final errors in Bk- 



are given in Ref. [l9j]. We use three lattice spacings — 
a w 0.12, 0.09 and 0.06 fm — which are called coarse, 
fine and superfine, respectively. These lattice spacings 
are the nominal values — in our analysis we actually use 
the values obtained on each ensemble from the value 
of n/a obtained by the MILC collaboration [H, Hj. 
These are listed in Table IIVI To obtain a we set n = 
0.3108(15)(t^) fm, as given in Ref. [3 using to set 
the scale. 



TABLE IV. Values for r\/a and 1/a. See text for discussion. 



ID 


n/a 


1/a (GeV) 


CI 


2.650(4) 


1.682(3) 


C2 


2.644(3) 


1.679(2) 


C3 


2.618(3) 


1.662(2) 


C3-2 


2.618(3) 


1.662(2) 


C4 


2.635(3) 


1.673(2) 


C5 


2.647(3) 


1.681(2) 


Fl 


3.699(3) 


2.348(2) 


F2 


3.695(4) 


2.346(3) 


SI 


5.296(7) 


3.362(4) 



III. DATA ANALYSIS 

In this section we explain how we calculate Bk, how 
we convert the numerical data into physical observables 
using matching factors, and how we extrapolate to the 
physical kaon mass and the continuum limit using SxPT. 



TABLE V. Valence quark masses (in lattice units). 
a (fm) am x and am y 

0l2 0.005 x n with n = 1, 2, 3, . . . , 10 

0.09 0.003 x n with n = 1, 2, 3, . . . , 10 

0.06 0.0018 x n with n = 1, 2, 3, . . . , 10 



A. Computation of Bk 



TABLE III. MILC lattices used for the numerical study. Here, 
"ens" represents the number of gauge configurations, "meas" 
is the number of measurements per configuration, and ID will 
be used later to identify the corresponding MILC lattice. 



The valence quarks are HYP-smeared staggered 



a (fm) 


ami /arris 


size 


ens x meas 


ID 


0.12 


0.03/0.05 


20 3 


x 64 


564 x 1 


CI 


0.12 


0.02/0.05 


20 3 


x 64 


486 x 1 


C2 


0.12 


0.01/0.05 


20 3 


x 64 


671 x 9 


C3 


0.12 


0.01/0.05 


28 3 


x 64 


274 x 8 


C3-2 


0.12 


0.007/0.05 


20 3 


x 64 


651 x 10 


C4 


0.12 


0.005/0.05 


24 3 


x 64 


509 x 1 


C5 


0.09 


0.0062/0.031 


28 3 


x 96 


995 x 1 


Fl 


0.09 


0.0031/0.031 


40 3 


x 96 


678 x 1 


F2 


0.06 


0.0036/0.018 


48 3 


x 144 


744 x 2 


SI 



We use the MILC ensembles listed in Table [HT1 They 
are generated with Nf = 2+1 flavors of improved (asqtad 
action) staggered sea quarks, using the rooting trick to 
cancel the effects of the extra tastes. 

The values of the light sea-quark mass (am;) and 
strange sea-quark mass (am s ) are given in Table HTT1 De- 
tails of the lattice generation, decorrelation times, etc. 



fermions, i.e. the configurations are HYP-smeared 37 1 
(using the coefficients of the HYP (II) choice in the con- 
vention of Refs. 0, H2|) and then we use the unimproved 
staggered action. We have found this to be as effective 
at reducing taste-breaking as the more elaborate HISQ 
action [30]. We use 10 different values of the valence 
quark masses (i.e. m x and m y ) running from ~ m s /10 
to ~ m s . They are given in Table |Vl In total, we have 
55 mass combinations for our valence PGBs: 10 degener- 
ate "pions" (m x = m y ) and 45 non-degenerate "kaons" 

K # m v)- 

To give a more precise indication of the size of our 
valence-quark masses, we quote in Table WT\ the values of 
the valence "physical" down and strange quark masses. 
These are obtained on each ensemble as follows. The 
strange mass, m y , is tuned so that Yp = (0.6858 GeV) 2 , 
which is the continuum s — s mass keeping only quark- 
connected contractions, as determined by Ref. 38]. The 



down mass, 



is then tuned so that G = 



The tunings require extrapolation, and we use the linear 
extrapolation in quark masses discussed below. For the 
coarse ensemble, we quote only results for the C3 ensem- 
ble, since those on the other coarse ensembles are very 
similar. Comparing to Table [Vj we see that our heaviest 
quark mass is somewhat lighter than the physical strange 
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quark mass on all ensembles. 



TABLE VI. Physical down and strange valence-quark masses 
(in lattice units). 



Ensemble 


am d 


arris 


C3 


0.00212(2) 


0.05174(5) 


Fl 


0.00142(2) 


0.03523(5) 


SI 


0.00102(1) 


0.02362(3) 



To calculate the four-quark matrix elements needed for 
Bk we follow the methodology developed in Refs. (39i — 
l4l| . We place wall-sources composed of U(l)-noise at 
timeslices t\ and t 2 > ti, each of which are used to pro- 
duce both a valence quark and antiquark. Boundary con- 
ditions are periodic in all directions. The sources consist 
of random phases for each color index on each site on 
the timeslice, and have the property of selecting only the 
kaon having the desired taste P (i.e. with quantum num- 
bers 75 ® £5) and having zero spatial momentum. One 
source produces a valence kaon, the other an antikaon, 
and these are contracted together in a gauge-invariant 
four-quark operator at an intermediate time t. These 
operators involve quark and antiquark fields spread out 
over 2 4 hypercubes, and thus live on the two timeslices t 
and t + 1. They are made gauge invariant using products 
of HYP-smeared links along the shortest paths joining 
quarks and antiquarks, averaged over all such paths. The 
operators are then summed over spatial directions so as 
to improve the signal. 

For completeness, we give expressions for the operators 
that we use. The four-fermion operator is, at tree-level, 



O k = 



3=1,4 



Oi att (t) = £S>„ x P][V, x Pj^t) 
y f 

o i r(t) = ££[^ >< p]K x p]nm 

V M 

Ol aU (t) = X P][A, x P]j(y,t) 

y m 

OT\t) = £$> M x P][A, x P] n (y,t) , (51) 

(52) 



y m 



(47) 
(48) 
(49) 
(50) 



where y is a vector labeling spatial hypercubes (so that its 
components are all even). The operators are composed 
of hypercube-based bilinears, each denoted by [S x F], 
where S is the spin and F the taste. The subscripts / and 
II distinguish the two ways in which the color indices in 
the bilinears are connected by products of links. A com- 
plete definition of these operators is given in Ref . [22[ ■ At 
one-loop order, the coefficient multiplying each operator 
differs from unity, as shown in Eq. (|59[) below. 

The four-quark matrix element should be independent 
of t when t\ < t < t 2 , as long as the operator is far 



enough from each source to remove contamination from 
excited states. This is because we always use a kaon 
and antikaon with the same mass. To obtain Bk we 
then divide this matrix element by that obtained via the 
"vacuum saturation" approximation, as shown in Eq. [5] 
Explicitly, the ratio of correlation functions we use is 



B K (t) = 



2{W{t l )0 K {t)W{t 2 )) 



(8/3)(W(h)[A x P](t))([A x P](t)W(t 2 )) ' 

(53) 

where W represents a wall source, and the factor of 2 
in the numerator accounts contractions present in the 
continuum which are absent for our lattice operator [40| . 
This ratio should be also be independent of t away from 
the sources, and cancels the coupling of the wall sources 
to the (anti)kaons. Thus our aim is to choose the sources 
so that there is a clear plateau, and then fit Bk to a 
constant in this region. 



0.291 




0.287 



10 



20 



30 



FIG. 2. Pion effective mass versus time from wall-source. Pa- 
rameters are am x = am y = 0.025 on the C3 coarse ensemble. 
The fit is described in the text. 

To gauge how far apart to place the sources, we look 
at the two-point correlator from the wall-source (say at 
t = 0) to the taste-P axial current (i.e. with quantum 
numbers 7475 <8> £5) summed over the spatial time-slices 
at times t and t + 1. This is one of the two factors in the 
denominator of Bk (and also shows how we obtain the 
taste-P kaon mass). We show an example of the resulting 
effective mass in Fig. [2] This is defined by equating the 
correlator at times t and t + 1 to the form 

f(t) = Z[cxp(-m^{t + 0.5)) - exp(-m 7r (T - t - 0.5))] , 

which includes both a direct and a "wrap-around" expo- 
nential (with T the lattice extent in the time direction). 
The plot is for a pion on one of the coarse ensembles with 
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a mass close to that of the physical kaon. We observe a 
noticeable contamination from the excited states up to 
t ~ 10, and that the signal degrades for 26 < t. Also 
shown is a fit of the correlator to the form (f5~4"]) in the 
plateau region. 

From this and similar plots, we know that on the coarse 
lattices we need to place our four-fermion operator at 
least 10 timeslices from both sources, i.e. that At — 
ti — t\ > 20. Here we assume that the coupling of the 
four-quark operator to excited states is comparable to 
that of the axial current — a reasonable assumption given 
that the vacuum saturation approximation turns out to 
be good to within a factor of two. The source separation 
cannot be too large, however, because the signal will then 
be contaminated by wrap-around contributions arising 
from our use of a periodic lattice. Thus we have chosen 
At to be somewhat larger than the minimum, but not too 
much larger. Our specific choices are given in Table [vTll 
We have held the physical distance between the sources, 
At /a, roughly constant as the lattice spacing is reduced. 
The table also gives the parameters that determine the 
fitting range: we fit from t = t\ + <l to t = t\ + t^ ■ — 
t2 — ft — 1. Note that our fit ranges are symmetrical 
with respect to the two wall sources since the four-quark 
operator resides on timeslices t and t + 1 . 



TABLE VII. Choices for the wall-source separation, At, and 
its ratio to the temporal length of the lattices, T, as well as 
the parameters determining the fitting range. 



a (fm) 


At 


At/T 


t L 


t R 


0.12 


26 


0.41 


10 


15 


0.09 


40 


0.42 


13 


26 


0.06 


60 


0.42 


22 


37 



We can estimate the systematic error due to wrap- 
around contributions as follows. These contributions 
have the kaon from one source propagating the "wrong" 
way around the lattice, so that the matrix element which 
occurs is (KqKq\O\0) or its conjugate. At leading order 
in continuum ChPT, this matrix element has the same 
magnitude (and opposite sign) as the desired matrix ele- 
ment (Ko\0\Kq). Since the wrap-around effect is small, 
when estimating its size we ignore the differences between 
these two matrix elements due to higher order SChPT ef- 
fects. Then the contamination is simply given by the ra- 
tio of the kaon propagators in the two cases. Adding the 
two contributions involving a single (anti)kaon wrapping 
around the lattice, we find that the ratio of contamina- 
tion to signal is 

R K {t) = 2exp{- am K {T- At)} 

x cosh{2am K [(t - h) - {At - l)/2]} .(55) 

This is suppressed by the overall exponential factor, but 
grows on either side of its minimum (which occurs mid- 
way between the sources) as a cosh with an exponent 
containing 2am,K . To give an idea of the size of the ef- 
fect we give two examples of the value of Rk at the edge 



of the fitting range (where it is maximal) . On the coarse 
lattices, with T = 64, At = 26 and t-t\ = t L -t\ = 10 we 
find i?^(t) = 0.015 for the lightest pion (am xy: p — 0.1342 
for am x = am y — 0.005), while (i) = 1.2 x 10~ 4 for 
the lightest kaon used in the SU(2) chiral fit (am xy -p — 
0.2749 for am x = 0.005 and am y = 0.04). The latter 
result is most relevant for an estimate of the size of the 
induced error in our final result for the physical kaon 
(which has a mass am v ^ ys « 0.3 on the coarse lattices). 
This is because, whether we use the SU(3) or SU(2) fits, 
the most important points are those closest to the place 
to which we are extrapolating. Thus we conclude that the 
wrap-around systematic is negligible for Bk , although it 
would be significant (at the few percent level) were we to 
quote a result for Bq (Bk in the chiral and continuum 
limits) . 

Based on the autocorrelation lengths for light hadron 
properties, the lattices in the MILC ensembles are some- 
what correlated [19(. Because of this, we attempt to re- 
duce correlations between our measurements of Bk by 
randomly choosing the overall source position, t\ , on each 
configuration (with t^ fixed to t\ + At). For three ensem- 
bles (C3, C3-2 and C4) we have increased our statistics by 
doing multiple measurements on each lattice (as shown 
in Table . We chose the t\ values for these measure- 
ments randomly and also set the random number seeds 
for the U(l) noise sources randomly 

Using these enlarged data-sets, we have done a prelim- 
inary study of autocorrelations by seeing how the error 
changes when we bin configurations together. This has 
been done for a pion two-point correlator, for which the 
errors are smaller than those for Bk- The results for en- 
semble C3 are presented in Ref. [16J. We find that the 
errors increase somewhat as the bin size increases, reach- 
ing a plateau for bins of size 3 or above, in which the 
errors are about 20% larger than those obtained treating 
each configuration as independent. This increase is only 
visible after we use the larger sample (9 measurements 
per configuration). 

We take this result as indicating that autocorrelations 
are a small effect. In particular, because our statistical 
errors turn out to be smaller than the systematic errors, 
a 20% increase in the former would have little impact 
on our final result. Thus we have ignored the possible 
impact of autocorrelations in this paper. 

Typical results for one- loop renormalized Bk, along 
with the fitting range and resulting value and error, are 
shown in Fig.[3J This figure shows the dependence on the 
lattice spacing when we use a kaon composed of degener- 
ate quarks but with a mass close to that of the physical 
kaon. The data have a similar profile for all three lat- 
tice spacings, showing contamination from excited states 
near the boundary but a reasonable plateau that extends 
further out than our fit region. The smaller size of the 
errors on the coarse lattice reflects the larger sample (9 
measurements on each lattice). 

The effect of varying the quark masses (at fixed lattice 
spacing) is illustrated by Fig. 2J The (blue) diamonds 
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FIG. 3. B K (n = l/a) as a function of T = t - t x . (Blue) 
diamonds are from the coarse ensemble C3, with am x — 
am y = 0.025; (red) crosses are from the fine ensemble Fl, 
with am x = am y — 0.015; and (brown) octagons are from 
the superfine ensemble SI, with am x — am y = 0.009. The 
result of a fit to a constant over the range given in Table IIIII 
is shown by the horizontal lines. Note that the T is given in 
fermis. 



show the same data as in Fig. [3l with degenerate quarks 
and an approximately physical kaon mass. The (red) 
crosses show the change when m y — m x is maximized (in 
our data set) with m x + m y fixed (so that the kaon mass 
is almost the same). One sees that the central value 
increases a little, and that the errors are significantly 
larger. The (brown) octagons shows what happens if 
both m x and m y are reduced to our smallest value, so 
that the resulting "kaon" is closer to a pion (having a 
mass « 220 MeV) . Here the central value is significantly 
reduced, and the errors are much larger. Fortunately, our 
results from light "kaons" play little role in determining 
the extrapolation to the physical kaon. 

In these figures, and in all subsequent analysis, errors 
are obtained using the jackknife procedure. This is true 
both for the errors on individual points and for the errors 
in fit parameters. When doing fits to multiple timeslices, 
and when fitting to multiple quark masses (as required 
for the chiral fits discussed below) we do not use the (in- 
verse of the) full covariance matrix, which he have found 
to lead to instabilities, but rather keep only the diago- 
nal elements. This means that we cannot quantitatively 
judge the goodness of fit, since the x 2 is generically un- 
derestimated. Assuming, however, that the fits are rea- 
sonable, the jackknife errors should be reliable. 

We close this subsection by describing how we deter- 
mine the values of m xy -B needed as inputs to the SChPT 



FIG. 4. As for Fig. except that all data is from the C3 
ensemble. (Blue) diamonds show results for am x — 0.025 and 
am y — 0.025, (red) crosses for am x = 0.005 and am y — 0.045, 
and (brown) octagons for am x = 0.005 and am y = 0.005. 

fits to Bk- For the taste-P meson, we measure the mass 
directly using our wall sources. We have four sets of two- 
point correlation functions: 

Ci(t) = (Ai(«)P(ti)) , (56a) 

C 2 {t) = (A±(t)P(t 2 )) , (56b) 

C 3 (t) = (P(t)P(ii)) , (56c) 

C 4 (t) = (P(t)P(t 2 )) . (56d) 

We determine the Goldstone pion masses by fitting these 
four sets of data individually and averaging the results. 

We have calculated the masses of PGBs having all 
other tastes on most of the ensembles. The methodology 
is explained in Refs. [30|, EH- Specifically, we use 8 "cubic 
wall sources" per configuration, and "Golterman-style" 
sink operators. Results for the best-measured tastes are 
shown in Figs. [5] and |6] for a coarse and the fine ensem- 
ble, respectively. The nondegenerate points are also con- 
sistent within errors with the observed linear behavior, 
though we do not show them for the sake of clarity. The 
results for the other tastes have masses that are com- 
pletely consistent with the 5*0(4) symmetry expected at 
LO. Given this, we choose to assume exact 50(4) symme- 
try for the masses used in the ChPT expressions, taking 
the values from the best measured tastes (such as those 
in the Figures). 

The LO prediction of Eq. ^ has the slopes being the 
same for all tastes. NLO effects will, however, lead to a 
difference between the slopes. We find that the slopes do 
have statistically significant differences, but that these 
differences are very small, a few percent or less [Hj]. In 
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FIG. 5. m xy r\ versus (ra x +m y )ri, for states with m x — m y , 
and tastes (from bottom to top) £5, £i£s, £i£4, £4 and 1. ri 
is the modified Sommer-parameter, whose value is 2.619a. 
The fits are discussed in the text. Results are for the coarse 
ensemble C3. 
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FIG. 6. As in Fig. [5] but for the fine ensemble Fl. Here 
ri = 3.701a. 



PGB masses that we use in chiral loops using 

K B = G + A B - A P , (57a) 
X B = X P + A B - A P , (57b) 
Y B = Y P + A B - A P . (57c) 

This choice effectively assumes that the slopes are equal 
to that of the taste P mesons. This approach reproduces 
the actual masses to within a few percent or better, and 
is, in any case, theoretically consistent as the LO masses 
can be used in loops. Put differently, the inclusion of the 
small differences between slopes in the chiral logarithms 
would lead to an analytic contribution of NNLO, i.e. of 
higher order than our expressions. 

We proceed in the manner just described on all en- 
sembles except the large- volume coarse ensemble (C3-2), 
the fine ensemble (F2), and the superfine ensemble (SI). 
On these three ensembles we have not measured the full 
taste spectrum, due to the high computational cost. Wc 
estimate the taste splittings on these ensembles as fol- 
lows. For (C3-2) we simply use the taste splittings from 
the smaller lattices with the same quark masses [ensem- 
ble (C3)]. For (F2) we assume the same taste splittings 
as for the other fine ensemble, (Fl), based on the fact 
that taste splittings on the coarse lattices are almost in- 
dependent of the light sea-quark mass. Finally, for the 
superfine ensemble, we obtain the Ab by extrapolating 
from the coarse and fine lattices. These quantities are 
expected to vanish in the continuum limit as a 2 a 2 . We 
thus fit our results from the C3 and Fl ensembles to 

Ab(<x) = b\ a 2 a 2 ((i = 1/ct) (58) 

with a s determined using 4-loop evolution from the Par- 
ticle Data Group value at n = Mz- The result is satisfac- 
tory, as shown in Fig. and leads to the values collected 
in Table IVTlTl Since the taste splittings on the superfine 
lattice are so small, the uncertainty introduced by the 
need to do this extrapolation has a negligible impact on 
our chiral fitting. 

TABLE VIII. Results for valence taste-splittings, A B (GeV 2 ), 
on ensembles C3, Fl and SI. Those on C3 and Fl are ob- 
tained by calculating PGB masses using cubic wall sources 
(see Ref. [4^ ). Those on SI are obtained by extrapolating 
using Eq. ((551) . 



a (fm) 


0.12 (C3) 


0.09 (Fl) 


0.06 (SI) 


A A 


0.02905(59) 


0.00912(29) 


0.003487(60) 


A t 


0.0565(13) 


0.01753(49) 


0.00666(12) 


A v 


0.0819(18) 


0.02612(68) 


0.00977(16) 


A s 


0.1052(88) 


0.0314(33) 


0.01219(80) 



B. Matching factors 

practice we do linear fits allowing the slopes to differ 

(i.e. allowing b\ to depend on taste B), and from these To convert the results of our lattice calculation into 
determine the intercepts A^. We then reconstruct the physical observables, we need to match lattice and con- 
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FIG. 7. Ab from ensembles C3 and Fl (the rightmost two 
sets of points). These are extrapolated, as described in the 
text, to obtain the results for the SI ensemble (leftmost set 
of points). Numerical values are given in Table [*VTTT1 



tinuum operators. The latter we define in the conven- 
tional way using naive dimensional regularization with 
MS subtraction. For the matching, we use one-loop per- 
turbation theory, which leads to the general form 



or nt (M) = E 



4-7T 



Cij) 



of* (45.9) 



Here superscripts "cont" and "latt" indicate continuum 
and lattice operators, respectively, 7y is the anomalous 
dimension matrix, the c,j are finite constants, and fi is 
the renormalization scale for the continuum operator. We 
choose the coupling constant here and below to be that 
defined in the continuum M S scheme (which we evaluate 
numerically using four- loop running from m-z)- The scale 
at which it is evaluated, q* , is undetermined by one-loop 
matching. 

In our calculation we are interested in a single con- 
tinuum operator, so i takes one value, but the matching 
involves many lattice operators, labeled by j. The opera- 
tors which appear are listed in Ref . [22[ ■ The values of c,j 
for our HYP-smeared operators, using the appropriate 
Symanzik-improved gauge action, have been calculated 
by three of us and will be presented in Ref. [43| . 

When using this matching formula we do not, in fact, 
include all the lattice operators which appear at one-loop, 
but instead keep only those with external tastes P. There 
are four of these, and they are listed above in Eqs. (|48l 
ISTj) . Dropping the other operators leads to the 0(a/ir) 
truncation errors discussed earlier. 

Having matched to the continuum operator at an in- 
termediate scale fi, we then evolve using the continuum 



renormalization group (RG) down to a canonical scale 
p = 2 GeV. We do so using the two-loop formula 



B K {NBR,p) 
Z 



[1 



47T 



Z] 



47T 



Z] 



a{n)J 



(60) 



Wo Po 
7 (o) 

Wo' 



Expressions for the ft and 7W are given, for example, in 
Ref. 0]. In this way we obtain B K (NDR,p = 2 GeV). 

In order to implement this matching and running we 
have to choose values for fi and q* . The result depends 
on these choices because we have truncated the match- 
ing at one-loop and the running at two-loops. In both 
cases the error we make is of 0(a 2 ). Based on the argu- 
ments of Ref. [45| , we use "horizontal matching" in which 
q* = fi. The choice of q* should be made so as to min- 
imize higher-order terms in perturbation theory, which 
requires that q* be a typical scale of momenta flowing in 
loops. We simply take q* = l/a. It is possible to make 
a better-informed estimate using the one-loop integrands 
themselves, but we have not yet attempted this. 

Finally, we recall that one can also define the RG in- 
variant quantity Bk, 



1 



a(fi) 



BrW, (61) 



which is often used in the continuum literature. This 
quantity has the apparent advantage that all trunca- 
tion errors are manifestly proportional to a(fi) 2 , with the 
scale fi being naturally chosen of 0(1/ a) (and in our case, 
chosen to be exactly l/a). Thus these errors vanish (al- 
beit logarithmically) in the continuum limit. By compar- 
ison, using Bk (NDR, 2 GeV) apparently leads to trun- 
cation errors which have a nominal size of a (2 GeV) 2 , 
which does not vanish in the continuum limit. This is, 
however, misleading, in that if different lattice calcula- 
tions use the same truncated formula to run to 2 GeV, 
they are in effect comparing Bk, so the real truncation 
errors in the comparison are ~ a (l/a) 2 . 



C. SU(3) fitting: Strategy 

We would like to fit our results for Bk to the func- 
tional form of Eq. (f2~Tj). For a given ensemble (fixed a 
and sea-quark masses) the fit form has 15 parameters^ 



7 There are 15 rather than 16 parameters because H§ is a constant 
for fixed sea-quark masses, and so can be absorbed into 61 (the 
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Despite the fact that we have 10 degenerate and 45 non- 
degenerate data points on each ensemble, we find a direct 
fit to Eq. (|21[) to be very difficult. Here we explain why 
and describe our strategies for dealing with this problem. 

We begin by sorting the 15 terms into 4 categories, 
according to the following criteria: 

• Does the term survive in the continuum limit, 
i.e. does it correspond to a term in continuum 
ChPT? Contributions from discretization errors 
(taste-breaking or otherwise) and from truncation 
errors do not have such a correspondence. 

• Does the term contribute to Bx in the degenerate 
case of m x = m y , or does it vanish in that limit? 

The result of this categorization is shown in Table IIX1 

TABLE IX. Classification of contributions, Hi, to the NLO 
SU(3) SChPT expression for B K , Eq. pT). when fitting at a 
fixed a and sea-quark masses. Terms in the "cont" column 
have a corresponding continuum form, while those labeled 
"non-cont" do not. Terms in the row labeled "deg" contribute 
both for m x — m y and m x 7^ m y , while those in the "non- 
deg" row contribute only for m x 7^ m y . Numbers in the table 
give the subscript, i, of the fit form Hi. Numbers underlined 
indicate those Hi which are kept in our final SU(3) fits. 





cont 


non-cont 


deg 


I, 2,3 


6, 7, 8, 9, 10 


non-deg 


4 


11, 12, 13, 14, 15, 16 









From Table IIX1 we observe that the fitting functional 
is somewhat simplified if we fit only to degenerate data 
points. There are then three "continuum-like" terms, 
Hi_3, and five terms which are lattice artifacts, iJ 6 _ 10 . 
Since we have 10 degenerate data points, we can, in prin- 
ciple, fit them to a functional form composed of 8 terms. 
In practice, this turns out to be very difficult, primarily 
because the functional forms of the five lattice-artifact 
terms are very similar for our range of PGB masses. 
This is not unexpected: the five functions have the same 
form [that of F M defined in Eq. (|A"9l) ]. but differ in the 
taste of the PGB whose mass appears. Thus one expects 
the functions to be similar except at the lightest quark 
masses where the taste-splittings are comparable to the 
masses themselves (see Fig. [5]). This is borne out by di- 
rect numerical evaluation. The similarity of the functions 
becomes more pronounced as one approaches the contin- 
uum limit, because of the reduction in taste splittings. 
Indeed, on the superfine lattices, one can argue that the 
0(a 2 ) effects should be treated as of NLO, and thus that 



coefficient of Hi). Doing so leads to additional chiral logarithms 
multiplied by quark masses, but these are of NNLO, and thus 
irrelevant to our NLO expression. Of course, having absorbed 
65 into b\ , we must subsequently take into account the resulting 
sea-quark mass dependence of b\. This we do in Sec. IIII Fl 



the difference between these five functions is really an 
effect of NNLO. 

In light of these considerations, we approximate all five 
functions by i.e. the function in which the mass 

is that of the taste-T PGB. We choose the tensor taste 
as its mass is roughly the average of those of all tastes. 
We stress that this is not a systematic procedure, since 
if (D(m) ~ 0(a 2 ), as is the case for the smallest quark 
masses on the coarse lattices, it is not legitimate to ex- 
pand log(JTB) about Xb = Xt and treat the difference as 
of higher order. Our procedure is simply a phenomeno- 
logical way of resolving the presence of nearly flat direc- 
tions in the fit function. 

With this approximation, the fitting function for the 
degenerate data-set collapses to 

/tt S = E^ (62) 

i—l 

where F\ = Hi, F% = H2, F3 = H3 [with the Hi as 
defined in Eqs. §MM] and F 4 = H 10 = F^ . Fits to 
this 4-parameter form are stable, and we use them as the 
first stage of most of our fitting strategies. 

The fit to ([62]) can also be constrained given our prior 
knowledge of the size of the coefficients. We expect from 
continuum ChPT power-counting that c\ ~ C2 ~ C3 ~ 
O(l). For C4 = 610 we have two possible estimates, given 
in Eq. (|35|) . depending on whether discretization or trun- 
cation errors dominate. One can systematically include 
this prior information in the fitting using the Bayesian 
method [4(|, as discussed in the next subsection. 

We now turn to the remaining 45 non-degenerate data 
points. As can be seen from Table IIX| including these 
points in the fit requires the introduction of one addi- 
tional term with a non- vanishing continuum limit, and 
six terms arising from lattice artifacts. Once again, how- 
ever, directly fitting to all these terms is difficult because 
some of the functions are similar. Thus we choose again 
to approximate the fitting function. 

We first observe that H n = F (5) [given in Eq. (|Al0j) ] 
is numerically close to twice H12 = F^ [given in 
Eq. (|A11[) ]. This proportionality is exact if taste- 
splittings are ignored. Thus we choose to drop Hn and 
keep H12 as a surrogate for both these functions. 

Next we note that H13 — Fy^ is very similar to 

H14 = Fj^' — differing only in the taste of the PGBs 
whose masses appear in the function. Thus we choose 
to keep only if 14 from this pair. 

(2) (2) 
Finally, we come to H i5 — F v ' and Hi 6 = F^ '. These 

are also numerically similar, but, more importantly, their 
coefficients are proportional to <5^ A2 and cry A2 , quanti- 
ties that we have argued are substantially suppressed (see 
Sec. Ill A| ). Thus we consider these contributions to be of 
NNLO and simply drop them. 

The outcome of these considerations is that the func- 
tional form we use when fitting our entire data-set con- 
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tains 7 terms: 



/th = ^2 CiFi 



(63) 



where the first 4 terms are the same as in Eq. (|62|) and the 
rest are F 5 = (the analytic term), F 6 = Hi 2 = F^ 

and F 7 = H u = We expect C5 = 0(1) and cq ~ C4, 

while C7 = 614 is given by the estimates in Eq. (|40|) . In 
some of our fits we impose the latter two estimates using 
the Bayesian method. 



D. SU(3) fitting: Implementation 




In this section, we use our fits to the C3 and SI en- 
sembles as examples to explain our detailed strategy and 
the results. 

We first fit the degenerate data to the 4-parameter 
form of Eq. ((62)) with no constraints. We label this fit 
"D-U" , with D for degenerate and U for unconstrained @ 

We show the result of this fit on the C3 and SI en- 



FIG. 9. Bk (NDR, fi — 1/a) vs. m 2 K for degenerate kaons on 
the SI ensemble showing a D-U fit. Notation as in Fig. [5] 



matrix and thus is expected to be much less than unity. 
In fact, a value approaching unity indicates a poor fit. 
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FIG. 8. Br-(NDR, fi — 1/a) vs. m 2 K for degenerate kaons 
(m x — m y ), on the C3 ensemble. A fit of type D-U is shown 
by the solid (blue) curve. The dotted (red) curve labeled 
"NLO" shows the impact of dropping the NNLO term. The 
dashed (brown) curve labeled "No F4" shows the impact of 
dropping the F4 term. 

scmblcs in Figs. [8] and [9l respectively. These ensembles 
have, to good approximation, the same sea-quark masses 
and thus differ mainly in the value of a. The fits give a 
good representation of the data, as is the case also on the 
other ensembles (not shown). The parameters from the 
D-U fits are collected in Table IxFI We stress again that 
the x 2 /dof uses only diagonal elements of the covariance 



8 This fit is called "D-T3" in our previous publications. 

9 Ensemble C3-2 is only used for an estimate of finite-volume er- 
rors, as described in Sec. IIII HI 



TABLE X. Parameters of D-U fits. The d are defined in 
Eq. (f6"2")l . x 2 /dof represents \ 2 P er degree of freedom calcu- 
lated using only diagonal elements of the covariance matrix. 



ID 


Cl 


C2 


C3 


c 4 (GeV 2 ) 


X 2 /dof 


CI 


0.283(41) 


0.88(40) 


0.24(41) 


0.0018(19) 


0.004(20) 


C2 


0.351(42) 


0.18(41) 


0.85(41) 


-0.0022(19) 


0.012(17) 


C3 


0.339(13) 


0.37(13) 


0.72(13) 


-0.0008(6) 


0.041(54) 


C4 


0.327(13) 


0.47(13) 


0.60(13) 


-0.0005(6) 


0.051(59) 


C5 


0.286(32) 


0.84(30) 


0.22(30) 


0.0013(14) 


0.08(11) 


Fl 


0.349(39) 


0.05(33) 


1.06(40) 


-0.0012(11) 


0.10(12) 


F2 


0.356(32) 


0.01(27) 


1.13(33) 


-0.0013(9) 


0.11(13) 


SI 


0.323(25) 


0.18(21) 


0.88(29) 


-0.0005(6) 


0.05(12) 



We find that ci_3 are of 0(1), indicating that the chiral 
expansion is behaving as expected. The convergence of 
the ChPT series is illustrated in the figures. At LO, Bk 
is a constant. The NLO form is shown by the (brown) 
dashed curves, and has curvature due to the chiral loga- 
rithms in Fi and Fa ■ Our phenomenological NNLO term 
makes up the difference between the full fit and the NLO 
form. The convergence is reasonable for ttik < m^ 5 , 
(e.g. on C3 we find LO:NLO:NNLO « 1 : 0.5 : 0.12 for 
mx ~ ™^ lys 0j but clearly poor for our heaviest kaon. 
We stress, however, that, because we do not have the 
complete NNLO expression, this analysis of convergence 
is only approximate. 

Since ci_3 are continuum parameters, and are dimen- 
sionless, we would expect approximate consistency be- 
tween their values in all the fits, up to a modest depen- 
dence on lattice spacing and on sea-quark masses. This 
expectation holds within errors — although we note that 
the errors in C2 and C3 are quite large. 

The coefficient C4 multiplies the sole NLO discretiza- 
tion/truncation term in our fit function. We find that 
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our fits do not require such a term — it is consistent with 
zero on all ensembles. The range of values of C4 allowed 
by the fits (as indicated by the errors) are such, how- 
ever, that this term can make a noticeable contribution. 
This is illustrated in the figures, where the contribution 
of the C4 term is given by the difference between the 
full fits and the "No F4" curves. It is seen to be rela- 
tively small (< 10%) over the entire mass range on the 
C3 ensemble, and in particular is considerably smaller 
than continuum-like NLO terms (and comparable to the 
NNLO terms) at m^ lys . As expected, its contribution on 
the superfine ensemble is generally much smaller than on 
the coarse ensemble. These results imply that we have 
been conservative in treating this term as of NLO in our 
power-counting. 

Another way of seeing this is to compare the values 
of C4 (and their errors) to the magnitudes expected from 
power counting. As noted above, these range from 



C4 



0(a 2 ) » A| CD (aA| CD ) 2 



0.0031 GeV on the coarse lattices, 
0.0015 GeV 2 on the fine lattice, (64) 
0.00075 GeV 2 on the superfine lattice. 



(where we have used Aqcd = 0.3 GeV) to 
0(a 2 ) 



C4 



0.010 GeV 
0.0069 GeV 2 
0.0050 GeV 2 



on the coarse lattice 

on the fine lattice 

on the superfine lattice 



(65) 



(where we have evaluated a at the scale y, = 1/a). Our 
results for \a ± <5c4 1 are at or below the smaller of these 
estimates on all ensembles. 

We have also considered the effect of constraining C4 
using Bayesian fits. Specifically, following Ref. [46||, we 
augment the usual x 2 with an additional term: 



XI 



Xprior 



(c 4 



Xprior 



(66) 
(67) 



Since we have no prior information on the sign of C4 we 
set 0,4 = 0. We do have prior information on the ex- 
pected size of C4 = 610, given in Eq. (1551) . and we choose 
04 accordingly. In particular we consider two Bayesian 
fits, one assuming that discretization errors dominate 
[04 ~ Aq CD (aAQ CD ) 2 ], the other assuming that trun- 



A 



,a 1. We label these 



cation errors dominate [a 4 ~ ^qcd"- 
"D-Bl" and "D-B2", respectively!^ The numerical val- 
ues of 0-4 are given in Eqs. and (p5|) above. 

We expect that adding these constraints will have little 
impact on the fits, given that the unconstrained fits yield 



10 These correspond to D-BT7 and D-BT7-2 in our previous publi- 
cations. 



values of C4 and 604 that are mostly consistent even with 
the stronger of the constraints. This is borne out by the 
results of the fits. We show the results for the D-Bl case 
in Table IXll There is no significant change in ci_3, but 
the central values, and the errors, in C4 have been reduced 
on some ensembles (with a corresponding increase in the 
augmented % 2 ). As expected, the less constrained D-B2 
fits are almost identical to the D-U fits, so we do not 
show them. 

TABLE XI. Parameters of D-Bl fits. 



ID 


Cl 


C2 


C3 


c 4 (GeV 2 ) 


Xaug/dof 


Cl 


0.300(25) 


0.71(23) 


0.42(23) 


0.0011(12) 


0.044(72) 


C2 


0.328(23) 


0.39(22) 


0.61(21) 


-0.0013(12) 


0.068(99) 


C3 


0.338(12) 


0.38(12) 


0.70(12) 


-0.0008(6) 


0.053(70) 


C4 


0.326(12) 


0.48(12) 


0.59(12) 


-0.0005(6) 


0.057(54) 


C5 


0.295(23) 


0.76(21) 


0.31(21) 


0.0009(10) 


0.11(15) 


Fl 


0.327(21) 


0.24(17) 


0.81(20) 


-0.0007(6) 


0.17(22) 


F2 


0.338(21) 


0.15(17) 


0.93(20) 


-0.0009(6) 


0.20(22) 


SI 


0.311(13) 


0.28(11) 


0.73(15) 


-0.0003(3) 


0.11(21) 



We now turn to fits to our full data set, i.e. to all 
55 degenerate and non-degenerate points. We first fit 
to the functional form (|63|) without constraints, labeling 
the fits "N-U" (with N for non-degenerate, U for uncon- 
strained) 03 The parameters of the N-U fit for all ensem- 
bles are given in Table IXII| and the quality of the fit is 
illustrated by Figs. [10] and QTJ To better display the 
fit, we plot Bk versus Xp rather than G = m 2 K . This 
has several advantages: it spreads the points out, giv- 
ing a sense of the quality of the entire fit; it displays the 
dependence on the light valence quark mass m x oc Xp, 
which is the critical parameter which must be extrapo- 
lated in order to reach the physical point; and it allows 
a more direct comparison with the SU(2) "X-fits" pre- 
sented below. The fits are reasonable — as is the case 
on all other ensembles. The C3 and C4 ensembles have 
larger x 2 /dof because the statistical errors are substan- 
tially smaller, which brings to light the inadequacies in 
our partial-NNLO fit formal 

We also include in the figure the value of 
Bk(NDR, fi = 1/a) extrapolated to the physical quark 
masses using the fit function, with all taste-breaking dis- 
cretization and truncation errors removed^^ This "phys- 
ical" Bk is plotted at the physical value ol Xp. We also 



11 This corresponds to N-T2 in our previous publications. 

12 As noted by the MILC collaboration, more precise lattice data 
requires the use of ChPT expressions of higher order, with the 
order chosen so the error from the truncation of the series is 
reduced to the level of the statistical errors. 

13 In detail, we remove the terms induced by lattice artifacts by 
setting C4 = ce = C7 = 0, remove all taste splittings in sea 
and valence mesons (Ab = 0), and choose physical valence- 



quark masses by setting Yb 



G = K B 



,phys> and X B 



m 2 - . 

ss ,pnys 

= 2m|, . 

-n-O.phys 



(0.6858 GeV) [38j> 

■ m L,phy S ( tlle lat " 
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TABLE XII. Parameters from N-U fits. The d are defined in Eq. (|63[) . and x 2 /dof is the uncorrelated x 2 P er degree of freedom. 
The last column gives the value for i?K(2GeV) that results when we set quark masses to their physical values and remove 
discretization errors (as described in the text) and then run the resulting Bk (l/ a ) d° wn to 2 GeV. 



ID 


Cl 


C2 


C3 


C4{Vje\ ) 


c 5 


eg (^ev ) 


C7^^rev ) 


X / dot 


£*K\{J> — ibev ) 


Cl 


0.275(38) 


0.96(37) 


0.15(37) 


0.0021(17) 


0.050(34) 


-0.0054(37) 


0.0012(5) 


0.0063(80) 


0.616(30) 


C2 


0.351(38) 


0.17(37) 


0.86(37) 


-0.0022(18) 


0.103(36) 


-0.0051(42) 


0.0015(6) 


0.0039(38) 


0.602(33) 


C3 


0.331(12) 


0.46(12) 


0.63(12) 


-0.0004(6) 


0.057(11) 


-0.0025(12) 


0.0008(2) 


0.135(41) 


0.596(10) 


C4 


0.319(12) 


0.56(12) 


0.51(12) 


-0.0001(6) 


0.062(11) 


-0.0034(12) 


0.0008(2) 


0.196(47) 


0.603(10) 


C5 


0.285(29) 


0.85(28) 


0.21(28) 


0.0014(13) 


0.069(24) 


-0.0052(27) 


0.0008(3) 


0.040(30) 


0.621(22) 


Fl 


0.330(37) 


0.21(31) 


0.87(38) 


-0.0007(10) 


0.053(25) 


-0.0017(23) 


0.0006(3) 


0.038(33) 


0.564(20) 


F2 


0.343(30) 


0.10(25) 


1.00(31) 


-0.0010(9) 


0.039(18) 


0.0000(18) 


0.0003(2) 


0.034(30) 


0.550(16) 


SI 


0.316(23) 


0.23(20) 


0.80(26) 


-0.0004(5) 


0.072(15) 


-0.0031(12) 


0.0005(2) 


0.018(22) 


0.581(12) 



include physical Bk values in Table Kill and subsequent 
tables for other fits, except that in the tables we run them 
to a common scale, — 2 GeV, so as to allow comparison 
between ensembles. 

We stress that we are only able to remove taste- 
breaking discretization errors because we have used a 
staggered ChPT fit in which such breaking is explicit. We 
also note that taste-conserving discretization and trunca- 
tion errors are not removed by this procedure, and must 
be dealt with separately. 

The values of the parameters C\ — C4 given in Table IXlTl 
are consistent with those from the fits to the degenerate 
data alone. This is a non-trivial result, suggesting that 
our truncation of the full NLO SU(3) expression down 
to 7 parameters can adequately describe the data. The 
"continuum non-degenerate" parameter C5 appears to be 
non-zero (this is most significant on the high-statistics 
ensembles C3 and C4), and is consistent across all ensem- 
bles. The "lattice non-degenerate" parameters Cq and oj 
appear also to be non-zero on the high-statistics ensem- 
bles C3 and C4, although they are consistent with zero on 
other ensembles. Our errors are too large to tell whether 
eg and C7 show the expected decrease as the continuum 
limit is approached. 

The most striking result, however, is that C5 is very 
small. It is expected to be of order unity, but turns out 
to be more than an order of magnitude smaller. We do 
not know of an explanation for this smallness. Indeed, 
its value is not of direct physical significance, since it de- 
pends on the scale chosen in the chiral logarithms (which 
we have taken to be /iDR = 0.77 GeV). Nevertheless, 
there is an important practical consequence of our find- 
ing that C5 is small. If C5 vanished, then fits to the de- 
generate data alone would allow a determination of the 
physical Bk, since F§ is the only continuum term in our 



ter assuming the observed linear dependence on quark masses, 
Eqs. J6}). In addition we choose physical sea-quark masses by 
setting L = m* phyB and S = rnj <o phyE . Here we are assum- 
ing Dashen's theorem, namely that there are no electromagnetic 
corrections to the neutral pion and kaon masses. Other choices 
for tuning the quark masses lead to results that differ by much 
less than the statistical errors. 



1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 




I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 

0.1 0.2 0.3 0.4 0.5 

X p (GeV 3 ) 



FIG. 10. B K (NDR,fi = l/a) as a function of X P on the C3 
ensemble. For each value of Xp oc m x , the tower of points 
corresponds to the different values of m y , with m v increasing 
from m x at the bottom to m™ ax = 0.05/a at the top. Curves 
show the N-U fit, both for fixed values of m y and for m x — m y . 
Error bands are not included for the sake of clarity. The (blue) 
octagon gives Bk{1/o) evaluated at the physical kaon mass 
with lattice artifacts removed (as described in the text). 



fit function that vanishes for degenerate quark masses. 
Since we find C5 to be very small, the degenerate data 
must play an important role in determining the extrap- 
olated, physical Bk- 

The magnitude of the parameter c§ is expected to lie in 
the same range as C4, namely from Aq CD (aAQCD) 2 (dis- 
cretization errors) to Aq CD q; 2 (truncation errors). Nu- 
merical values for these ranges are given in Eqs. (|64j) and 
(pfj)) above. We see from Table IXlTl that some of the N-U 
fit values of Cq exceed the smaller of these expectations, 
though not the larger. As for Oj, if discretization errors 
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0.1 0.2 0.3 0.4 0.5 

X p (GeV 2 ) 

FIG. 11. B A '(NDR,/i = l/a) as a function of X P on the SI 
ensemble. Notation as in Fig. [10] 



dominate we expect 

c 7 « A^ CD (aA QC D) 2 

f 0.00027 GeV 4 (coarse) 
w J 0.00014 GeV 4 (fine) (68) 
[ 0.000068 GeV 4 (superfine) 

while if truncation errors dominate we expect 

c 7 ~ A QCD Q s 

f 0.00090 GeV 4 (coarse) 

« < 0.00062 GeV 4 (fine) (69) 

[ 0.00045 GeV 4 (superfine) 

In this case, some of the fit values of c-j are larger than 
both of these expectations. 

We have thus carried out Bayesian fits simultaneously 
enforcing the expected sizes of C4, eg and C7. As in the 
degenerate case, \ 2 is augmented by 

2 ^ ^ (^2 ^2) / \ 

Xprior / j ' \'"J 

j=4,6,7 ' 

We set 114,6,7 = 0, and make two choices for the <7j. The 
first assumes that discretization errors dominate, 

0-4 = ct 6 = Aq CD (aAQ C D) 2 , cf-7 = AQ CD (aAQ C D) 2 , 

while the second assumes that truncation errors domi- 
nate, and leads to the weaker constraints: 

ct 4 = <t 6 = A^pja 2 , 07 = AQ CD a 2 . (72) 



We label these fits N-Bl and N-B2, respectively^ 



0.6 


1111,111 
A\vi^~^ — f — 


,1111,1111, 






u\NKT~ — $ — * — 

ivv^r* — 






0.5 






- 


0.4 


1 < 1 1 1 1 1 1 


SxPT, N-Bl 

, 1 . . . 1 1 . . , , 1 





0.1 0.2 0.3 0.4 0.5 

X p (GeV 2 ) 

FIG. 12. Bk(X/ol) vs. Xp on the C3 ensemble, showing the 
N-Bl fit. Notation as in Fig. [10] 

The N-Bl fit on the C3 ensemble is shown in Fig. [121 
The fit appears to be of comparable quality to that in 
Fig. [10] — the somewhat larger \ 2 indicates, however, that 
the constraints are having a non-trivial impact. The most 
notable changes compared to Fig. [10] are that the curva- 
ture at small Xp (and fixed m y ) is smaller, and that the 
value of flif(l/a) is reduced. Similar changes are seen on 
all ensembles. 

The parameters from the N-Bl and N-B2 fits are given 
in Tables IXIIII and IXIVI The values of c\ — C4 from the 
two fits are consistent with each other and with those 
from the N-U fits. For several ensembles, however, C5 
is not consistent between the fits, most notably on the 
C3 and C4 ensembles. This suggests that the errors are 
underestimated and/or that the truncated fit function 
does not fully represent our data. We conclude only that 

|cb| < 1- 

Our tentative conclusion of a non-zero eg from the N- 
U fit is not confirmed by the two new fits. Results of 
both signs are found. On most ensembles the results 
from the three fits are consistent with each other, and 
with a vanishing result. On C3 and C4, there are some 
marginal inconsistencies. We conclude that we cannot 
reliably determine eg. The situation for c-j is slightly 
better, since all three fits are consistent within 2-<r, and 
all have the same sign. 

The results for B K {2 GeV) from the three fits (N-U, 
N-Bl and N-B2) are not consistent. The differences are 



They correspond respectively to N-BT8 and N-BT8-2 in our pre- 
vious publications. 
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TABLE XIII. Parameters of N-Bl fits. Notation as in Table [XlTl 



ID 


Cl 


C2 


C3 


c 4 


C5 


C6 


C7 


2 / l r 
Xaug/dof 


Bk — 2 GeV) 


CI 


0.265(30) 


1.05(29) 


0.06(29) 


0.0026(14) 


-0.033(23) 


0.0021(9) 


0.0002(1) 


0.067(36) 


0.557(12) 




U.olz^ol ) 


U.OD^oU ) 


n a ^c\\ 

U.4D^oU ) 


— U.UUUO^lO ) 


— U.UUO^Z I ) 


u.uuoo^y ) 


U.UUUo^l ) 


u. iuo^oo ) 


U.04U^lo J 


C3 


0.321(12) 


0.55(12) 


0.53(12) 


0.0000(6) 


0.005(7) 


0.0017(3) 


0.0004(1) 


0.22(40) 


0.560(5) 


C4 


0.309(12) 


0.65(12) 


0.40(12) 


0.0003(6) 


0.004(7) 


0.0013(3) 


0.0004(1) 


0.283(41) 


0.562(4) 


C5 


0.275(27) 


0.94(25) 


0.11(26) 


0.0017(12) 


-0.006(19) 


0.0009(7) 


0.0002(1) 


0.072(39) 


0.567(10) 


Fl 


0.306(29) 


0.42(24) 


0.61(29) 


-0.0001(8) 


0.008(19) 


0.0014(5) 


0.0001(0) 


0.103(57) 


0.533(10) 


F2 


0.327(26) 


0.24(22) 


0.82(26) 


-0.0006(8) 


0.020(17) 


0.0012(4) 


0.0001(0) 


0.080(51) 


0.539(9) 


SI 


0.296(18) 


0.41(16) 


0.55(21) 


0.0000(4) 


0.011(12) 


0.0005(2) 


0.0001(0) 


0.094(56) 


0.533(7) 



TABLE XIV. Parameters of N-B2 fits. Notation as in Table IXlTl 



ID 


Cl 


C2 


C3 


C4 


C5 


C6 


C7 


Xaug/dof 


B K (n = 2GeV) 


Cl 


0.263(38) 


1.07(37) 


0.03(37) 


0.0026(17) 


-0.027(20) 


0.0010(11) 


0.0004(2) 


0.021(12) 


0.562(14) 


C2 


0.333(39) 


0.35(38) 


0.67(38) 


-0.0014(18) 


0.003(23) 


0.0030(11) 


0.0006(2) 


0.028(15) 


0.533(14) 


C3 


0.328(12) 


0.48(12) 


0.60(12) 


-0.0003(6) 


0.041(8) 


-0.0012(9) 


0.0007(1) 


0.151(39) 


0.585(8) 


C4 


0.316(12) 


0.58(12) 


0.48(12) 


0.0000(6) 


0.045(8) 


-0.0020(9) 


0.0007(1) 


0.214(44) 


0.591(8) 


C5 


0.277(30) 


0.93(29) 


0.13(29) 


0.0016(13) 


0.019(13) 


-0.0013(11) 


0.0004(2) 


0.050(31) 


0.585(11) 


Fl 


0.321(38) 


0.28(32) 


0.77(39) 


-0.0005(11) 


0.011(15) 


0.0010(7) 


0.0003(1) 


0.047(33) 


0.532(8) 


F2 


0.340(31) 


0.13(26) 


0.96(31) 


-0.0009(9) 


0.024(11) 


0.0010(7) 


0.0002(1) 


0.038(32) 


0.539(6) 


SI 


0.309(23) 


0.30(20) 


0.71(27) 


-0.0003(5) 


0.038(8) 


-0.0012(6) 


0.0003(1) 


0.036(23) 


0.554(5) 



most significant on the C3 and C4 ensembles, where they 
exceed 3-c. This is caused by the poor determination of 
some of the fit parameters, particularly c.5 and eg. Al- 
though all the fits give a reasonable representation of the 
data, the results obtained after extrapolation to physical 
quark masses and removal of taste-breaking differ. This 
is because C5 and cq terms have significantly different de- 
pendence on quark masses and taste breaking. 

To do better, we recall that, because of the smallness 
of C5, we can almost determine the physical Bk using 
the degenerate data alone. Furthermore, the D-U, D-Bl 
and D-B2 fits give consistent results for ci_4, which we 
take to indicate that these results are reliable. Thus, in 
our last stage of SU(3) fitting, we use the results from 
the degenerate fits as constraints on the parameters ci_4 
when doing fits to the full data set. We dub these "dou- 
ble Bayesian" fits, and think that they provide the most 
reliable way of determining the parameters C5 and c% . 

In more detail, we first do a D-Bl or D-B2 fit to de- 
generate points as described above. The central values 
and errors from these fits are then used as constraints 
on ci_4 in a second fit to the full data-set, along with 
the constraints on just described. In other words, if 
the first fit yields a.i ± &i for i = 1,4, then the second fit 
augments the x 2 by 

Xprior — Aprior (1) + Aprior (2) (73a) 

2 _ (pi ~~ a i) 2 /7CfU\ 

Xprior (1) — / , ^2 K'^ ) 

1=1 1 

Aprior (2) = X/ ~^2~ 
i=6,7 * 



with 06,7 given either by Eq. (I7T|) . if the first fit was D- 
Bl, or by ([72) otherwise. We label these fits N-BB1 and 
N-BB2, respectively!!! 

Examples of the resulting fits are shown in Figs. [T31 and 
[14l and fit parameters from all ensembles are collected in 
Tables KV] and KVD 

I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 




I i i i i I i i i i I i i i i I i i i i I i i i i I 
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X p (GeV 3 ) 



FIG. 13. Bk (1/a) vs. Xp on the C3 ensemble, showing the 
N-BB1 fit. Notation as in Fig.fTOl 



These correspond respectively to the N-BT7 and N-BT7-2 fits in 
our previous publications. 
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I i i i i I i i i i I i i i i I i i i i I i i i i I 

0.1 0.2 0.3 0.4 0.5 

X p (GeV 3 ) 



FIG. 14. Bpc(l/a) vs. Xp on the C3 ensemble, showing the 
N-BB2 fit. Notation as in Fig.fTUl 

The values of the Ci, the augmented x 2 , and Bk, are 
consistent between fits N-Bl and N-BB1, and between 
N-B2 and N-BB2. The major changes induced by using 
two levels of constraints are that the errors in c\ — C3 are 
reduced, while those in Bk are increased. The former 
is expected, since we are placing constraints on c\ — C3 
in the second stage of fitting. The latter is indicative of 
the uncertainties in the fit parameters induced by dis- 
cretization errors, particularly in cq. With c\ — C3 pinned 
down, a greater range of cq values are explored, leading 
to more variation in the artifact subtraction needed to 
obtain the physical Bk- Since this variation represents 
a genuine systematic error in our procedure, and is best 
represented by the N-BB fits, we choose the N-BB fits 
for our central values. Specifically, we choose fit N-BB1, 
using N-BB2 and N-Bl to estimate different fitting sys- 
tematics. 

We see from tables that the difference between the 
physical Bk obtained from the N-BB1 and N-BB2 fits 
can be as large as 6% on the coarse and superfine lattices. 
To understand this difference in more detail, Table IXVlII 
gives a breakdown of the contributions to Bk on both the 
C3 and SI ensembles. Here, valence quark masses have 
been set to their physical values, but taste-splittings have 
not been removed. In other words, this is the result of 
using the SChPT fit form to extrapolate the lattice data 
on this ensemble to the physical valence-quark massesP^I 
The chiral series shows poor convergence, with NLO:LO 
~ 2/3, although the NNLO term is of reasonable size 
(NNLOTO « 0.13). We see also that the lattice artifacts 



We stress that the values for Bk (1/a) differ from those in Figs. ll3l 
and 1141 because the latter have taste splittings set to zero. 



(given in the NLO-04^.7 column) amount to no more than 
a quarter of the total NLO contribution on the coarse 
ensemble, and significantly less on the superfine ensem- 
ble. The major difference between the two N-BB1 and 
N-BB2 fits is in the relative size of these lattice artifacts 
and the continuum C5 contribution. Since the artifacts 
are removed when we quote a physical value, it is this 
difference which is mainly responsible for the differences 
one sees in the physical Bk between the N-BB1 and N- 
BB2 fits. Given our data, it appears difficult to resolve 
this uncertainty. 

We close this subsection by further illustrating the im- 
pact of lattice artifacts. Figures fTS] and [TB] show the 
dependence of Bk on Xp oc m x for the N-BB1 fit on 
the C3 and SI ensembles, with m y set to the physi- 
cal strange-quark mass. The three curves on each plot 
show the fit function itself (solid), the fit function with 
lattice- artifact terms removed (dashed), and the fit func- 
tion with all taste-violating lattice artifacts removed, and 
sea-quark masses set to their physical values (dotted). 
The points with errors indicate the resulting "physical" 
Bif(l/a). Only the "SU(2) regime" is shown, for which 
m x /mP hys < 1/2. We observe that the size of the to- 
tal taste- violating discretization errors (the difference be- 
tween the solid and [red] dotted curves) is moderate on 
the course lattices (6% or less for m x > mf ys ), and 
somewhat reduced on the superfine lattices. 

We also observe that the curvature in the fit functions 
is almost entirely due to the lattice-artifact terms. Re- 
moving these gives the (blue) dashed curves, which show 
much less curvature. The chiral logarithms that remain 
if lattice- artifact terms are removed are proportional to 
log(JCr), and thus do not diverge in the chiral limit be- 
cause Xi is non-vanishing in this limit due to taste split- 
ting. After removing taste splitting (giving the [red] dot- 
ted curves), there is a logarithmically divergent term (an 
SU(2) partially quenched chiral logarithm), but it has a 
small coefficient and is barely visible. 

We summarize our experience with SU(3) fitting as 
follows. The fits are reasonably successful at reproduc- 
ing the data, but have two major drawbacks. First, the 
convergence of the chiral series is suspected for physical 
quark masses, which undermines our use of SChPT to re- 
move unphysical lattice artifacts. Second, our fits do not 
determine the size of the lattice-artifact terms, leading 
to significant differences between the "physical" results 
from different fits. To some extent these two drawbacks 
"cancel" , and we can hope that the systematic error we 
estimate by comparing fits takes into account the un- 
certainties in removing the artifacts. Nevertheless, the 
reliance on SU(3) SChPT is unsatisfactory. 

E. SU(2) fitting 

We now turn to our second fitting strategy, in which 
we fix m y close to mP hys , extrapolate m x to the phys- 
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TABLE XV. Fit parameters for N-BB1 fits. Notation is as in Table IXTJ 



ID 


Cl 


C2 


C3 


c 4 


C5 


C6 


C7 


xLg/dof 


B K (p = 2GeV) 


CI 


0.2944(54) 


0.767(45) 


0.386(36) 


0.0015(5) 


-0.017(15) 


0.0023(9) 


0.0002(1) 


0.084(61) 


0.555(12) 


C2 


0.3269(51) 


0.409(41) 


0.623(34) 


-0.0011(6) 


0.004(16) 


0.0036(8) 


0.0003(1) 


0.108(54) 


0.538(12) 


C3 


0.3345(28) 


0.418(27) 


0.677(20) 


-0.0006(2) 


0.017(5) 


0.0013(5) 


0.0004(1) 


0.250(55) 


0.562(6) 


C4 


0.3229(27) 


0.514(26) 


0.560(19) 


-0.0003(2) 


0.016(5) 


0.0009(5) 


0.0004(1) 


0.320(72) 


0.564(5) 


C5 


0.2913(51) 


0.790(43) 


0.285(32) 


0.0011(5) 


0.003(12) 


0.0009(6) 


0.0002(1) 


0.078(61) 


0.567(10) 


Fl 


0.3243(47) 


0.253(28) 


0.825(35) 


-0.0005(3) 


0.015(12) 


0.0015(5) 


0.0001(0) 


0.111(55) 


0.535(9) 


F2 


0.3367(49) 


0.155(32) 


0.939(30) 


-0.0008(3) 


0.024(10) 


0.0013(4) 


0.0001(0) 


0.078(45) 


0.540(8) 


SI 


0.3088(33) 


0.293(19) 


0.727(28) 


-0.0002(2) 


0.016(7) 


0.0006(2) 


0.0001(0) 


0.105(63) 


0.535(6) 



TABLE XVI. Fit parameters for N-BB2 fits. Notation is as in Table IXIll 



ID 


Cl 


C2 


C3 


C4 


C5 


C6 


C7 


Xaug/dof 


B K (v = 2GeV) 


Cl 


0.2801(92) 


0.905(86) 


0.217(67) 


0.0019(7) 


-0.013(15) 


0.0006(17) 


0.0005(2) 


0.026(26) 


0.564(17) 


C2 


0.3444(87) 


0.235(82) 


0.795(59) 


-0.0019(7) 


0.013(15) 


0.0027(17) 


0.0007(2) 


0.029(20) 


0.535(17) 


C3 


0.3369(27) 


0.395(26) 


0.700(19) 


-0.0007(2) 


0.056(15) 


-0.0020(15) 


0.0008(2) 


0.164(42) 


0.592(14) 


C4 


0.3251(26) 


0.493(25) 


0.580(18) 


-0.0004(2) 


0.060(14) 


-0.0029(15) 


0.0008(2) 


0.227(56) 


0.598(13) 


C5 


0.2851(68) 


0.850(61) 


0.213(50) 


0.0013(5) 


0.027(17) 


-0.0017(20) 


0.0004(2) 


0.052(40) 


0.588(19) 


Fl 


0.3409(92) 


0.112(69) 


0.996(65) 


-0.0010(4) 


0.025(12) 


0.0006(12) 


0.0004(2) 


0.055(31) 


0.539(12) 


F2 


0.3511(76) 


0.030(56) 


1.095(50) 


-0.0012(4) 


0.034(13) 


0.0006(14) 


0.0003(2) 


0.042(28) 


0.545(13) 


SI 


0.3189(63) 


0.207(46) 


0.843(52) 


-0.0005(2) 


0.048(11) 


-0.0015(10) 


0.0003(1) 


0.042(25) 


0.560(11) 



B K 

0.62 r 




B K 




FIG. 15. B K (l/a) vs. X P (in GeV 2 ) on the C3 ensemble from 
the N-BB1 fit. The solid (black) curve shows the fit function 
with m y = ms hys , the dashed (blue) curve shows the result of 
removing the lattice-artifact terms (i.e. those proportional to 
C4, C6 and or), and the dotted (red) curve shows the "physical" 
Bk, i.e. with lattice artifacts and taste-splittings removed, 
and with the sea-quarks set to their physical values. The 
data-point shows the result for m x = m^ hys . 



ical light-quark mass, and then extrapolate m y to the 
physical strange quark mass. At all stages we need 
m x < m y ~ mf ys in order that SU(2) SxPT can be 
used. The fitting function is given in Eqs. (I41j|45p . and 
has the key feature that there are no terms arising solely 
from discretization or truncation errors. 

On each ensemble, we extrapolate to mP hys using our 
three heaviest y quarks (e.g. am y — {0.04, 0.045, 0.05} 
on the coarse lattices). We label this choice "3Y". The 



FIG. 16. As in Fig. [15] but for the SI ensemble. 



extent of the extrapolation varies substantially between 
the ensembles, as can be seen by comparing Tables IVl 
and |Vll Our largest values of am y lie 3%, 15% and 24% 
below the physical strange quark mass, on coarse, fine 
and superfine lattices, respectively. 

For the chiral extrapolation we use either our light- 
est 4 or 5 values of m x , choices we label 4X and 5X, 
respectively. For the "4X3Y" choice, which leads to 
our preferred fit, the SU(2) expansion parameter satis- 
fies m x /m y < 1/2. 

For fixed sea-quark masses, the fitting function (|4"Tj) 
has three terms. The first two are present at NLO, while 
the third is of NNLO. We have thus tried two types of 
"X-fits": NLO (two parameter) and NNLO (three pa- 
rameter) . 

We begin by considering the 4X3Y-NLO fits. The first 
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TABLE XVII. Breakdown of contributions to B K in the SChPT N-BB1 and N-BB2 fits on the C3 and SI ensembles. Valence 
quark masses are set to their physical values. 



Ens. 


Fit 


B K (l/a) 


LO 


NLO-C!_ 2 


NLO-cs 


NLO-C4, C6,7 


NNLO-C3 


C3 


N-BB1 


0.598 


0.335 


0.157 


0.013 


0.052 


0.042 


C3 


N-BB2 


0.609 


0.337 


0.151 


0.045 


0.033 


0.043 


SI 


N-BB1 


0.530 


0.309 


0.144 


0.012 


0.020 


0.045 


SI 


N-BB2 


0.529 


0.319 


0.125 


0.038 


-0.006 


0.052 



stage is the "X-fit", which we do separately for the 3 
largest values of am y . The fits for the largest am y on 
ensembles C3 and SI are shown, respectively, in Figs. [T7] 
and[THJ The (mild) curvature in the fit function is due 
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17. _Bjf(l/a) versus Xp on coarse ensemble C3, for 
am y = 0.05. The 4X NLO fit is shown by the solid (black) 
line. The horizontal (blue) line gives the contribution of the 
LO term. The (magenta) dashed curve has taste-splittings 
removed, while the (red) dotted curve shows the result of 
removing taste-splittings and setting the the light sea-quark 
to its physical value. The (red) point on this line is at Xp = 
m% a and shows the errors from the fit. 
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FIG. 18. Bk{X/ol) versus Xp on the superfine ensemble SI, 
for am y = 0.018. The data point at the largest Xp is not 
included in the fit. Notation as in Fig. 1171 

to the chiral logarithms. As in the SU(3) fit, these do 



not diverge in the chiral limit because of taste-breaking. 
The figures also show the relative size of the LO and 
NLO contributions. The NLO contribution (difference 
between the solid [black] and horizontal [blue] lines) is 
small, not exceeding 10% over the range of the fits. The 
(magenta) dashed curves show the impact of removing 
taste-splittings from the fit function, so that the chiral 
logarithms do diverge. The impact is seen to be small 
in the region of our data, most notably on the superfine 
lattice. The (red) dotted curves show the form in the con- 
tinuum limit, with physical light sea-quarks, as predicted 
by the fit. The single point on these curves shows the re- 
sult (including errors from the fit) for m x = m^ hys . The 
difference between the fit and these "continuum" curves 
is seen mainly to be the result of setting the light sea- 
quark mass to its physical value, which alters the size 
of the chiral logarithm. This correction for unphysical 
effects is clearly dependent on our use of SChPT. 

The solid [black] and dotted [red] curves in these fig- 
ures can be compared, approximately, to the correspond- 
ing curves from the SU(3) fits shown in Figs. IT51 and [TBI 
The comparison is not exact because the values of m y 
differ (most notably for the SI ensemble). For the C3 
ensemble, the curvature in the SU(2) fit is much smaller 
than in the SU(3) fit. This is because the curvature in 
the SU(3) fits is dominated by the lattice-artifact terms, 
which are absent in the SU(2) fits, and very poorly de- 
termined in the SU(3) fits. The SU(2) fit is superior in 
that it incorporates the expectation that these artifacts 
are small in the SU(2) regime. The continuum curves 
also differ between SU(2) and SU(3) fits, with the chiral 
log being weaker in the latter. This is possible because 
of 0(m x /m y ) corrections to its coefficient that are of 
NNLO in SU(2) SChPT and thus dropped in the SU(2) 
fit. Again, the SU(2) fit is superior, since the size of these 
corrections is not known in the SU(2) limit. 

The parameters of the NLO X-fits, and the resulting 
"physical" Bk , are given (for the largest values of am y 
only) in Table IXVlIII for all ensembles. Some fits have a 
relatively high (uncorrelated) x 2 /dof, indicating that the 
NLO fit function may be insufficient. The fit parameters 
themselves are consistent on all the coarse lattices, with 
evidence for monotonic variation with a 2 . 

After repeating this procedure for the three heaviest 
values of m y , we then proceed to the "Y-fit". The re- 
sults for "continuum, physical" Bk from the X-fits are 
expected to be a smooth, analytic function of Yp oc m y . 
Examples of the data are shown in Figs. [TS] and [501 
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TABLE XVIII. Fit parameters for the 4X-NLO fits. The va- 
lence strange quark mass is fixed at our heaviest value, i.e. 
am y = 0.05, 0.03, and 0.018 on coarse, fine and superfine 
lattices, respectively. The result for Bk(1/o) is obtained 
from the fit by setting taste-splittings to zero, L = mJL and 



ID 


di 


d 2 


X 2 /dof 


B K {n = l/a) 


CI 


0.548(11) 


0.107(49) 


0.19(26) 


0.5568(96) 


C2 


0.556(12) 


0.060(54) 


0.36(37) 


0.564(11) 


C3 


0.5602(34) 


0.035(16) 


0.83(54) 


0.5676(31) 


C4 


0.5581(35) 


0.039(16) 


0.57(45) 


0.5656(31) 


C5 


0.5501(85) 


0.066(40) 


0.08(16) 


0.5582(77) 


Fl 


0.5107(86) 


0.089(45) 


0.06(14) 


0.5189(76) 


F2 


0.5146(75) 


0.066(40) 


0.03(11) 


0.5222(66) 


SI 


0.4808(50) 


0.143(30) 


0.06(14) 


0.4900(43) 



We extrapolate to the "physical" value of the ss meson, 
Yp = (0.6858 GeV) 2 [38|]. In all cases, our results arc 
consistent with a linear dependence, and so we use a fit 
to 



fph 



b 2 Y P 



(74) 



to obtain our central value. We also do a quadratic "fit" 
in order to estimate the systematic error arising from 
this extrapolation. The parameters from the linear 3Y 
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FIG. 19. B K (l/a) from the 4X-NLO fit vs. Y P , on the coarse 
ensemble C3. A linear extrapolation to the physical strange 
quark mass is shown. 



fits are collected in Table IXIXl The fits are good, and 
the parameters are consistent across all ensembles. Also 
quoted are the values of Bk obtained by extrapolation in 
Yp and subsequent running to the common scale 2 GeV. 

We have repeated this analysis using NNLO X-fits (i.e. 
keeping the Q3 term). The X-fits are shown in Figs. |2"T1 
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FIG. 20. B K (l/a) from the 4X-NLO fit, versus Y P , on the 
superfine ensemble SI. A linear extrapolation to the physical 
strange quark mass is shown. 



TABLE XIX. Parameters of 3Y fits (using input from 4X- 
NLO fits) and the resulting B K (fi = 2GeV). 



ID 


61 


b 2 


X 2 /dof 


B K (v = 2GeV) 


CI 


0.482(15) 


0.162(22) 


0.0009(6) 


0.5488(95) 


C2 


0.466(16) 


0.214(27) 


0.0026(10) 


0.556(10) 


C3 


0.4899(47) 


0.1712(73) 


0.0154(26) 


0.5599(30) 


C4 


0.4866(50) 


0.1725(75) 


0.0135(28) 


0.5577(31) 


C5 


0.476(12) 


0.179(19) 


0.0015(8) 


0.5503(75) 


Fl 


0.433(13) 


0.215(23) 


0.0017(10) 


0.5416(77) 


F2 


0.437(11) 


0.214(19) 


0.0037(15) 


0.5454(66) 


SI 


0.4061(69) 


0.233(14) 


0.0049(17) 


0.5385(46) 



and [22l for the C3 and SI ensembles, respectively. The 
resulting fit parameters are given in Tables IXXl and IXXII 



TABLE X X. Fit p arameters for the 4X-NNLO fits. Notation 
as in Table IXVIIll 



ID 



CI 
C2 
C3 
C4 
C5 
Fl 
F2 
SI 



di 



d-2 



d 3 



xVdof B K ((i = l/a) 



0.561(19) 

0.577(21) 

0.5695(60) 

0.5660(62) 

0.557(15) 



-0.13(20) 

-0.29(23) 

-0.127(67) 

-0.097(68) 

-0.05(16) 



0.88(60) 
1.31(69) 
0.61(20) 
0.50(20) 
0.42(48) 



0.031(51) 

0.090(93) 

0.16(12) 

0.12(10) 

0.037(56) 



0.565(14) 

0.577(16) 

0.573(5) 

0.571(5) 

0.562(11) 



0.517(15) 
0.519(13) 



-0.04(19) 
-0.02(17) 



0.54(66) 
0.35(57) 



0.007(25) 
0.002(14) 



0.523(11) 
0.525(10) 



0.4834(93) +0.09(13) 0.27(51) 0.075(94) 0.491(7) 



The NNLO X-fits are better able to capture the curva- 
ture seen in the data (particularly on the coarse ensem- 
bles), and have correspondingly reduced x /dof. The 
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FIG. 21. Bk(1/cl) versus Xp on coarse ensemble C3, for 
am y = 0.05. The 4X-NNLO fit is shown by the solid (black) 
line. The horizontal (blue) line gives the contribution of the 
LO term. The (magenta) dashed curve shows the fit with 
NNLO term removed. The (red) dotted curve has taste- 
splittings removed and the light sea-quark mass set to its 
physical value. The (red) point on this line is at Xp — m\ a 
and shows the errors from the fit. 




FIG. 22. _Bx(l/a) versus Xp on the superfine ensemble SI, 
for am y — 0.018. Notation is as in Fig. 1211 The data-point 
with the largest value of Xp is not included in the fit. 



convergence of the chiral series begins to break down on 
the C3 ensemble at the largest fit values of Xp, in the 
sense that NNLO : NLO w 1. The convergence on the 
SI ensembles is satisfactory for all four fit points. The 
linear Y-fits remain very good on all ensembles (and are 
not shown). 

Comparing Tables IXlXl and IXXll we see that NLO 
and NNLO fits yield "physical" Bk values that are con- 
sistent within errors. We have also done NNLO 5X3Y 
fits, and simultaneous X and Y extrapolations (rather 
than sequential), both of which give consistent results. 

Clearly, SU(2) SChPT fitting is much more straightfor- 
ward than that to the SU(3) form. No fitting parameters 
need be dropped, and no Bayesian constraints are re- 
quired. While we use only a fraction of our data (roughly 
a quarter), the points we keep are those which lie closest 



TABLE XXI. Parameters of 3Y fits (using input from 4X- 
NNLO fits), and the resulting B K {^ = 2GeV). 



ID 


6i 


b 2 


xVdof 


B K (v = 2GeV) 


CI 


■iOI C 1 Q~l 


1 39("31 ~1 


ooo?^ 


^7(14-) 


C2 


0.478(22) 


0.215(39) 


0.0012(6) 


0.569(16) 


C3 


0.5081(64) 


0.144(11) 


0.0053(18) 


0.5651(46) 


C4 


0.5080(69) 


0.137(11) 


0.0038(17) 


0.5621(47) 


C5 


0.493(17) 


0.150(27) 


0.0003(4) 


0.554(11) 


Fl 


0.443(18) 


0.199(34) 


0.0005(6) 


0.544(11) 


F2 


0.444(15) 


0.203(27) 


0.0016(11) 


0.547(10) 


SI 


0.4121(98) 


0.220(20) 


0.0020(12) 


0.5385(72) 



to the physical kaon and thus should dominate the ex- 
trapolation to the physical point. As for the SU(3) fits, 
the use of SChPT (rather than, say, an analytic extrap- 
olation) is crucial, as is allows us to remove the effect of 
taste-breaking, and to account for the shift in the chiral 
logarithm due to the use of an unphysical light sea-quark 
mass. We cannot, however, claim to have provided a 
test of the predicted chiral logarithms; instead, we have 
shown consistency with our data. 

We compare our SU(2) results with those from SU(3) 
fitting when we discuss error budgets in Sec. IIII Hi below. 



F. Dependence of Bk on sea quark masses 

The fits presented above are on individual ensembles, 
and thus at fixed values of lattice spacing and sea-quark 
masses. Both SU(2) and SU(3) SChPT predict, at NLO, 
linear dependence of Bk on the light sea-quark mass 
(once chiral logarithms have been accounted for, as has 
been done above). This enters through the H§ term in 
SU(3) SChPT and the Q A term in SU(2) SChPT. At 
NLO, this dependence can either be treated as that of 
the leading order coefficients (bi and di) or of Bk it- 
self. We take the latter approach, and investigate this 
dependence on the coarse and fine lattices, where we have 
results with more than one light sea-quark masses. We 
also discuss the impact of the mismatch of the strange 
sea-quark mass with its physical value. 

We plot Bk(h = 2GeV) on the coarse lattices from 
the N-BB1 SU(3) fits against am e in Fig.l2"l The data 
show only weak dependence on ami. We perform both 
constant and linear extrapolations to the physical value 
oiami, and find that they lead to consistent results. The 
corresponding results from the 4X3Y-NNLO SU(2) fit are 
shown in Fig. [M] Here the linear and constant fits are 
almost indistinguishable. We collect the results of these 
extrapolations in Table IXXIII 

For the fine lattices, we have two light sea-quark 
masses, corresponding approximately to ami — 0.005 
(C5) and ami = 0.01 (C3) on the coarse lattices. As on 
the coarse lattices, the data have very weak dependence 
on ami, as shown in Figs. l25l and [26l 

To quantify the size of the mass dependence, we use 
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FIG. 23. B K (fi = 2GeV) from N-BB1 SU(3) fits versus arm 
(light sea-quark mass) on MILC coarse ensembles. The data 
come from Table IXVI Linear and constant fits are shown, 
along with the resulting value after extrapolation to the phys- 
ical value of ami. The errors on the point at ami = 0.01 and 
0.007 are smaller due to the larger number of measurements 
made on the C3 and C4 ensembles. 



TABLE XXII. BkO = 2GeV) on the coarse lattices after 
extrapolation to the physical light sea-quark mass, for various 
choices of fit. 



S X PT 


valence fit 


ami fit 


B K 


SU(3) 


N-BB1 


constant 


0.5613(33) 


SU(3) 


N-BB1 


linear 


0.5680(56) 


SU(2) 


4X3Y-NNLO 


constant 


0.5628(30) 


SU(2) 


4X3Y-NNLO 


linear 


0.5624(57) 



the form expected from either SU(2) or SU(3) SChPT 

B K (am e ) = B K (am e - 0) [l - L P /A 2 ] . (75) 

The linear fit shown in Fig. [21 has A w 2.8 GeV. This is 
much smaller than the canonical size expected in ChPT, 
namely Awl GeV, indicating that the sea-quark mass 
dependence is indeed weak. We note that mass depen- 
dence with this value of A is also consistent with the fine 
lattice data. 

For the superfine lattices, we have only a single light 
sea-quark mass, and so cannot perform an extrapolation 
to arn^ hys . Instead, we assume (based on the fits above) 
that, to first approximation, there is no dependence on 
ami. We do so consistently for all three lattice spacings, 
taking ensembles C3, Fl and SI for subsequent contin- 
uum extrapolation. We use these three ensembles since 
they all have mi/m s = 1/5, with m s ~ mP hys , and thus 
have closely matched sea-quark masses. 

Clearly this procedure leads to a systematic error if 
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FIG. 24. B K (p = 2GeV) from 4X3Y-NNLO SU(2) fits versus 
ami (light sea-quark mass) on MILC coarse ensembles. The 
data come from Table IXXll Details are as in Fig. 1231 
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FIG. 25. As in Fig. [23] but for the MILC fine ensembles. 



there is, in fact, a dependence on ami. We estimate this 
error assuming that the dependence is that given by the 
linear fit of Fig. [23] Specifically, we take the difference 
between the result on ensemble C3 with that after lin- 
ear extrapolation to amf 1 ^ (as given in Table IXXII|) . 
divided by the C3 value, as our estimate of the frac- 
tional error from this source. The resulting error is 1.2% 
for the SU(3) fits, and 0.06% for the SU(2) fits. This is 
quite likely an underestimate in the SU(2) case, but since 
other sources of error are significantly larger, we have not 
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FIG. 26. As in Fig. [H but for the MILC fine ensembles. 



attempted to improve on this estimate. 



TABLE XXIII. Nominal (am s ) and actual (am^ hyB ) strange 
sea-quark masses [ilj ]. 



MILC 


am s 


am p s hys 


coarse 


0.05 


0.0350(7) 


fine 


0.031 


0.0261(5) 


superfine 


0.018 


0.0186(4) 



Finally, we should account for the difference between 
the values of am s used in the MILC ensembles and mP hys . 
These values are given in Table IXXIIII The most reliable 
approach would be to have results with more than one 
value of am s , and extrapolate/interpolate to the physical 
value. Absent this possibility, we can use SU(3) ChPT 
to estimate the resulting error. In SU(3) ChPT, the de- 
pendence on sea-quark masses is obtained from Eq. (|75[) 
by the substitution Lp — > Lp + Sp/2. Assuming this, 
and using the masses in Table IXXIIII as well as the value 
of A from the linear fit in Fig. [531 we find that the cor- 
rection to B K {2 GeV) is +0.0075, +0.003, and -0.0006 
for the coarse, fine and superhne ensembles, respectively. 
Extrapolating to the continuum linearly in a 2 leads to a 
final correction of —0.0028. We take the magnitude of 
this correction as an estimate of the m s mismatch error 
for both SU(3) and SU(2) fits. 

The analysis described in this subsection does not 
make optimal use of all our data. For example, ensemble 
F2 is only used to check the (lack of) ami dependence. 
By doing a combined continuum and ami fit we could 
likely do better. We have not attempted such a fit, how- 
ever, since this source of error is subdominant. Concern- 
ing the am s mismatch, we plan in the near future to do 



an exact interpolation to the physical strange sea quark 
mass using the reweighting technique |47j |. 

G. Continuum extrapolation 

At this stage of the analysis we have results for Bk on 
three lattice spacings. As noted above, we take these to 
be the results from the C3, Fl and SI ensembles. Run- 
ning to the common scale fi = 2 GeV yields, for various 
SU(3) and SU(2) fits, the results in Tables IXXIVI and 
IXXVI respectively. Most of these results have appeared 
in earlier Tables, but we also include results for the SU(2) 
5X fits for completeness. The dominant errors remaining 
at this stage are those due to taste-conserving discretiza- 
tion and truncation errors. These depend on the lattice 
spacing as a 2 a n , where n — 0,1,.... (n=0 is allowed 
since we do not use Symanzik-improved operators), and 
as a 2 . Since we cannot disentangle these effects with a 
fit to three points, we choose to fit to a linear function 
of a 2 , and estimate the error we make by dropping other 
dependences. The resulting fits are shown in Figs. |2"T1 and 

cm 



TABLE XXIV. B K (n = 2GeV) from various SU(3) fits. 



a (fm) 


N-U 


N-Bl 


N-B2 


N-BB1 


N-BB2 


0.12 
0.09 
0.06 


0.596(10) 
0.564(20) 
0.581(12) 


0.560(5) 

0.533(10) 

0.533(7) 


0.585(8) 
0.532(8) 
0.554(5) 


0.562(6) 
0.535(9) 
0.535(6) 


0.592(14) 
0.539(12) 
0.560(11) 



TABLE XXV. 5^(2 GeV) from various SU(2) fits. 



a (fm) 


4X3Y-NLO 


4X3Y-NNLO 


5X3Y-NLO 


5X3Y-NNLO 


0.12 


0.560(3) 


0.565(5) 


0.558(3) 


0.563(4) 


0.09 


0.542(8) 


0.544(11) 


0.541(7) 


0.543(10) 


0.06 


0.538(5) 


0.538(7) 


0.539(4) 


0.538(6) 



TABLE XXVI. B K (/j, = 2GeV,a = 0) from various fits. 



Group fit B K (n = 2GeV,a = 0) 

SU(3) N-BB1 0.524(7) 

SU(3) N-BB2 0.539(14) 

SU(2) 4X3Y-NLO 0.531(6) 

SU(2) 4X3Y-NNLO 0.529(9) 



For the SU(3) analysis, we take the continuum value 
obtained with N-BB1 fits as our central value. The differ- 
ence between the N-BB1 and N-BB2 fits will be used as 
a fitting systematic, as discussed in the next subsection. 
To account for the fact that the discretization errors can 
depend on a 2 times a to some power, rather than just 
a 2 , we take the difference between the results on the SI 
ensemble and in the continuum as the systematic error in 
the extrapolation. This is reasonable because including 
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FIG. 27. -Br; (2 GeV) versus a 2 , together with a linear extrap- 
olation to the continuum limit. The data is obtained using 
SU(3) N-BB1 fits. 
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FIG. 28. Bk(2 GeV) versus a 2 , together with a linear extrap- 
olation to the continuum limit. The data is obtained using 
SU(2) 4X3Y-NNLO fits. 



powers of a moves the SI ensemble closer to the con- 
tinuum limit relative to the other two ensembles. It is 
conservative because our data strongly indicates that the 
actual continuum value lies below that obtained on en- 
semble SI. 

We follow the same strategy for the SU(2) analysis, 
using the 4X3Y-NNLO fits for our central value. 



Corrections proportional to a 2 , arising from truncation 
of the matching factors, are estimated separately, as we 
now discuss. 



H. Error budget 

In this subsection we estimate the remaining system- 
atic errors and combine them with those obtained above 
to give the complete error budget. 

We begin by estimating the impact of truncating 
matching factors. We assume that the dominant miss- 
ing term in the perturbative expansion is of magnitude 
1 x a s (l/a) 2 , and thus take as our error estimate 



Ai4 2) 



Here B^ is the one-loop corrected result, and a s is the 
MS coupling constant. The resulting error estimates for 
the SU(3) analysis are given in Table IXX VIII The results 



sW( M =l/a)x a,(jji=l/a) 



(76) 



TABLE XXVII. Estimates of the truncation error in the 
SU(3) analysis. We use the results of the N-BB1 fits, and 
ensembles C3, Fl and SI. 



a (fm) 




Us 




0.12 


0.5728(57) 


0.3286 


0.062 


0.09 


0.5271(88) 


0.2730 


0.039 


0.06 


0.5123(56) 


0.2337 


0.028 



for the SU(2) analysis are almost identical. 

The continuum extrapolation discussed above will ex- 
trapolate away some of this error, so that the truncation 
error in the continuum result will be smaller than that 
on the superfine ensemble. To be conservative, however, 
we use the size of AB K on the superfine ensemble as an 
estimate of this error. It is appropriate to be particularly 
conservative for the truncation error, since the choice of 
unity multiplying a 2 is a guess. We note, however, that 
the same approach applied to the tree-level data (i.e. us- 
leads to an overestimate of the shift 



ing AB$ 



B$>a) 



between B$ and B^ ] 0. 

A further source of error is due to our use of a finite 
volume (FV). This error can be approximately accounted 
for by fitting to the finite volume form of the NLO SChPT 
result (discussed in Appendix lAl . We have not yet im- 
plemented this approach, however, as it is very compu- 
tationally demanding. We have used SChPT to make 
preliminary estimates of the size of the corrections that 
might be induced [l6j]. Considering the C3 ensemble, we 
found that the FV error predicted by SU(3) SChPT was 
~ 1% after extrapolation to physical masses. For SU(2) 
SChPT the error was substantially smaller. Since these 
two results disagree, while the actual FV error has a def- 
inite (though unknown) size, we concluded that SChPT 
at NLO is not a reliable quantitative guide to the size of 
FV effects. 
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Here we adopt a more direct approach. We determine 
the impact of changing the volume by comparing the re- 
sults on ensembles C3 and C3-2. These differ only in their 
spatial volumes, which are 20 3 and 28 3 , respectively. The 
statistical weights of the two lattices are almost equal be- 
cause 



20 3 x 9 x 671 1 nnA 

-Rstat = 5 = 1.004 . 

stat 28 3 x 8 x 274 



(77) 



A comparison of results from these two lattices (after 
extrapolation to physical quark masses) is given in Ta- 
ble IXXVUIl We take the difference in Bk between C3 
and C3-2 ensembles as our estimate of the finite volume 
error. By doing so, we are effectively assuming that the 
28 3 lattices have negligible FV effects, which is what is 
found using NLO SChPT. 



TABLE XXVIII. Volume dependence of B K . We show 
B K (fi = 2 GeV) from N-BB1 fits for SU(3), and from 4X3Y- 
NNLO fits for SU(2). 



analysis 


20 3 (C3) 


28 3 (C3-2) 


SU(2) 


0.5651(46) 


0.5699(45) 


SU(3) 


0.5623(56) 


0.5754(56) 



The final source of significant error is our choice for the 
decay constant that appears (as 1//^) in the coeffi- 
cient of the chiral logarithms. As should be clear from the 
description of our fits, our data are not sensitive enough 
to the contributions of chiral logarithms to allow a fit to 
/ w . Thus we must fix it. We choose = 132 MeV for 
our central value, which is a somewhat outdated approx- 
imation to the physical value, = 130.4 MeV 0. (We 
have checked that results for Bk are indeed insensitive 
to the small difference between these two values.) It is 
reasonable to use the physical pion decay constant since 
the loops which give the dominant curvature are those 
involving pions. It is, however, equally valid at NLO 
to use the decay constant in the chiral limit, which is 
« 108 MeV for SU(3) ChPT and ps 124.2 MeV for SU(2) 
ChPT [Hi In the SU(3) fits, it is also equally valid at 
NLO to use the physical value of the kaon decay con- 
stant, f K fa 156.5 MeV QJ]. 

It is straightforward to determine an "/„. error" for our 
SU(2) fits — we quote the difference between the results 
obtained with 132 MeV and 124.2 MeV. For the SU(3) 
fits, we have concluded after some experiments that the 
fn error is already reasonably accounted for by the error 
we include to account for the dependence on the Bayesian 
fitting scheme ["fitting (2)" below]. Since we do not use 
the SU(3) fits for our central value, we have not pursued 
this further. 

We now collect all sources of error and present the error 
budget. We do this separately for the SU(3) and SU(2) 
analyses. The SU(3) budget is given in Table IXXIXI 
Most of these errors have been discussed either in this 
subsection or earlier ones. The exceptions are the two 



"fitting" errors and the n error. The first fitting error 
estimates the uncertainty due to the possibility of im- 
plementing the Bayesian method in different ways. The 
N-Bl fit uses one level of Bayesian constraints while the 
N-BB1 fit uses two levels. We prefer the latter method, 
for reasons explained earlier, but to be conservative we 
take the difference between the results of the two fits as a 
systematic error. The second fitting error estimates the 
impact of making different assumptions about the prior 
information used in the Bayesian fits. The N-BB1 fit as- 
sumes that the dominant lattice artifacts are of 0(a 2 ) 
while the N-BB2 fit assumes that they are of 0{a 2 s ). We 
take the difference between these fits as an estimate of 
this error. The resulting rather large error roughly en- 
compasses the range of results obtained with different 
SU(3) fits, as can be seen from Table IXXIVI Thus it can 
be interpreted more generally as an estimate of the error 
due to uncertainties in the parameters of the fit function 
corresponding to lattice artifacts. 

TABLE XXIX. Error budget (in percent) for B K obtained 
using SU(3) fitting. 



cause 


error (%) 


memo 


statistics 


1.4 


N-BB1 fit 


matching factor 
discretization 


5.5 
2.2 


AB^ (SI) 

diff. of (SI) and (a=0) 


fitting (1) 


0.36 


diff. of N-BB1 and N-Bl (C3) 


fitting (2) 


5.3 


diff. of N-BB1 and N-BB2 (C3) 


ami extrap 


1.0 


diff. of (C3) and linear extrap 


am B extrap 


0.5 


constant vs. linear extrap 


finite volume 


2.3 


diff. of 20 3 (C3) and 28 3 (C3-2) 


scale n 


0.12 


uncertainty in n 



Finally, we have propagated the uncertainty in ri, 
which sets the scale, through the analysis. We do so 
by repeating the analysis using n ± er ri to determine the 
lattice spacing. 

Our final result from the SU(3) analysis is 

B K (NDR,/i= 2 GeV) = 0.5237±0.0074±0.0438 , (78) 

where the first error is statistical while the second is the 
sum of the systematic errors in quadrature. The former 
error has magnitude 1.4% and the latter 8.4%. 

The error budget for the SU(2) analysis is presented in 
Table IXXXI The errors are as for the SU(3) analysis ex- 
cept for the two fitting errors and the / ff error. The first 
of these estimates the impact on the fit of the addition 
of higher-order terms in the SChPT fit function used for 
the X-fits. Specifically, we use the difference between the 
NLO and NNLO fits on the 4X3Y data set. The second 
fitting error results from the uncertainty in the Y-fit: we 
quote the difference between linear and quadratic extrap- 
olations to the physical point Yp = 2m 2 K — m%. 

Our final result of Bk from the SU(2) analysis is 



B K (NBR,fx= 2 GeV) = 0.5290±0.0090±0.0316 , (79) 
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TABLE XXX. Error budget of B K in the SU(2) fitting. 



cause 


error (%) 


memo 


statistics 


1.7 


4X3Y NNLO fit 


matching factor 


5.5 


AB^ (SI) 


discretization 


1.8 


diff. of (SI) and (a=0) 


ntting yi) 


n go 


A- lit {{-"J) 


fitting (2) 


0.08 


Y-fit (C3) 


ami extrap 


0.48 


diff. of (C3) and linear extrap 


arris extrap 


0.5 


constant vs. linear extrap 


finite volume 


0.85 


diff. of 20 3 (C3) and 28 3 (C3-2) 


ri 


0.14 


n error propagation 




0.38 


132 MeV vs. 124.4 MeV 



with the statistical error having magnitude 1.7% and 
the total systematic error 5.9%. Although the statis- 
tical error in this result is somewhat larger than that 
from the SU(3) analysis — a reflection of the smaller data 
set used in the chiral extrapolation — the systematic er- 
ror is considerably smaller, as is the total error. This 
improvement results from the simplifications which oc- 
cur in SU(2) SChPT, which allow more straightforward 
fitting. Because of this we use the SU(2) analysis for our 
final result, while the SU(3) result provides a consistency 
check. 



IV. CONCLUSION 

We have presented a calculation of Bk using im- 
proved staggered fermions with 2 + 1 flavors of dynamical 
fermions. We have used three different lattice spacings 
and multiple choices for the valence and sea quark masses 
in order to perform the extrapolations to the continuum 
limit and the physical quark masses. A key ingredient 
in our analyses are the fitting forms predicted by stag- 
gered chiral perturbation theory, for they allows us to 
determine, and remove, the impact of taste-breaking dis- 
cretization errors. To carry out our analysis, we needed 
to generalize previous SU(3) SChPT results for Br by 
including the effects of a mixed action (HYP-smeared va- 
lence quarks on asqtad sea) and by extending the results 
to SU(2) SChPT. 

A striking feature of the SU(2) SChPT result is that 
taste-violating discretization and truncation errors do 
not enter until NNLO. This means that fitting to the 
NLO form (as we do) is as simple as for fermions with 
exact chiral symmetry. Although we can use only a quar- 
ter of our complete data set when doing the SU(2) fits, 
this is more than compensated for by the simplicity of 
fitting. 

Using these ChPT results, we have performed two in- 
dependent analyses based, respectively, on SU(3) and 
SU(2) SChPT at NLO. The former requires some ad- 
hoc simplifications of the fitting expression and the use 
of Bayesian constraints to obtain stable fits. We find sig- 
nificant sensitivity to the prior information used in the 



Bayesian procedure, as well as to the precise implemen- 
tation. The SU(2) ntting does not require the use of 
Bayesian priors, and leads to significantly smaller overall 
errors. Hence, we use the SU(2) analysis for our final 
result (f79|). while the SU(3) result, ([78]). provides a con- 
sistency check. Converting to the renormalization group 
invariant Bk , we find 

B K = 0.7243 ± 0.0123 ± 0.0433 = 0.724 ± 0.045 . (80) 

Our result is in good agreement with the two existing 
results which use 2+1 flavors of dynamical fermions and 
control all sources of error. Aubin, Laiho and Van de Wa- 
ter use a mixed action with valence domain wall fermions 
on the MILC coarse and fine lattices and find [3| 

B K = 0.724 ± 0.008 ± 0.028 = 0.724 ± 0.029 . , (81) 

The RBC-UKQCD collaboration use both domain wall 
valence and sea quarks, finding 21] 

B K = 0.720 ± 0.013 ± 0.037 = 0.724 ± 0.039 (82) 

on a single lattice spacing of 1.16 fm, with a preliminary 
result (without an estimate of finite volume effects) 

B K = 0.738 + 0.026 (83) 

based on two lattice spacings (a = 0.86 and 1.16 fm) 
and updated fitting [5j. Our result is also consistent 
with the previous calculation using staggered fermions 
on the coarse (a « 0.12 fm) MILC lattices, which finds 
B K = 0.83 ± 0.02 ± 0.18 [8]. We view it as a significant 
success of lattice QCD that results with valence staggered 
and domain- wall fermions are consistent. Although our 
calculation shares lattices with that of Ref . 0] , it is com- 
pletely independent of that of Refs. [5l [2lj. 

Compared to the other two calculations, our result has 
the advantages of pushing closer to the continuum limit, 
and being based on three, rather than two lattice spac- 
ings. It also is significantly computationally cheaper. 
A clear disadvantage is that we must deal with taste- 
breaking, although this is much less of an issue with 
SU(2) ntting than with SU(3). Even with SU(2) ntting, 
however, we do rely on ChPT to a greater extent than 
the other calculations. 

Our total error (6%) is significantly larger than that 
from the other calculations — 4.1% for Ref. [3j and 3.5% 
for Ref. Q. This difference is due almost entirely to 
our use of perturbative (one-loop) matching rather than 
the non-perturbative renormalization (NPR) used by the 
other calculations. Quantitatively, our matching error is 
5.5% to be compared to 3.3% Q and 2.4% @. In addi- 
tion, our error estimate is less reliable since it is based 
on an educated guess about the possible size of two-loop 
terms. 

It is also of interest to compare our result to those 
obtained using two flavors of dynamical quarks (but a 
quenched strange quark) . This may be a useful compari- 
son because there are indications that the dependence of 
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dimensionless quantities on the mass of the strange sea 
quark is weak. Indeed, our results for Bx show weak 
dependence on all sea-quark masses, at least within the 
range of masses that we consider. In fact, our result is 
consistent with those obtained from Nf — 2 calculations. 
A recent calculation using twisted-mass fermions, and 
with all errors controlled except for that due to quenching 
the strange quark, finds B K = 0.729 ± 0.025 ± 0.016 [49j . 
The result obtained with dynamical overlap fermions, us- 
ing a single lattice spacing, is — 0.758 ± 0.006 ± 
0.071 [50]. With domain- wall fermions on a single lattice 
spacing, the result is B K = 0.699 ± 0.025 Finally, 
with improved (untwisted) Wilson fermions, the best re- 
sult is B K = 0.69 ±0.18 

Looking into the future, our result would be compet- 
itive with those of the other Nf =2 + 1 calculations if 
we could reduce our matching error down to the the level 
achieved in these other calculations. We are following 
two approaches to achieve this reduction: using 2-loop 
perturbation theory (which would reduce our error esti- 
mate to 1.3%, although this would remain an educated 
guess) and using NPR to determine the matching fac- 
tors (which we expect to lead to similar errors to those 
quoted by the other calculations.) First results with NPR 
using improved staggered fermions and studying bilinear 
matching factors are encouraging (53|. 

Other improvements that are underway are the use of 
a fourth lattice spacing (the MILC "ultrafine" lattices 
with spacing a ~ 0.045 fm), SU(2) fitting including the 
finite-volume effects predicted by NLO SChPT, simulta- 
neous continuum and chiral fitting, the addition of en- 
sembles with other values of the strange quark mass, and 
an improvement in statistics for all ensembles. We are 
also calculating matrix elements of operators with other 
Dirac structures, since these are needed to constrain some 
theories of physics beyond the standard model. 
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Appendix A: Functions appearing in SU(3) fitting 

In this appendix we present explicit forms for the func- 
tions that appear in the NLO SU(3) SChPT result for B K 
with a mixed action. 



1- -MSonn and M° disc 

These two functions are unaffected by the use of a 
mixed action, and thus can be obtained from the general 
results of Ref . [IH . Here we give fully explicit results for 
2 + 1 flavors. First we have 



A^on^E^ (Al) 
B 

-\[{G + X B )£(X B ) + (G + Yb)£(Y b ) 
-2(G - K B )£{K B ) - 2GKb£(Kb)} , (A2) 



p(3) _ 

-^B ~ 



where the coefficients give the relative weight of the dif- 
ferent tastes in the loop 



^=1/16, 
r A = 4/16. 



r p = l/16, 
t t = 6/16, 



4/16, 



(A3) 



which arc simply proportional to their multiplicity. Two 
standard chiral logarithmic functions arise: 



£(X)=X[log(X/ f i 2 DR ) + S F 1 Y (X)] , 
= -log(A/^ R )-l + <5 3 FV (A), 



(A4) 



(A5) 



where /*dr is the scale introduced by dimensional regu- 
larization. The [j>dr dependence of the chiral logarithms 
is canceled by a corresponding dependence of the LECs. 
When we evaluate £(X) and £(X) numerically we set 
/Udr = 0.77 GcV. Although we do not use them in this 
paper, we quote for completeness the finite volume cor- 
rections to the chiral logarithms: by 

^(M^E*- (A6) 



J3 '(M*)=2J2K (\H\ML).. 

n#0 



(A7) 



with L the box size, and n is a vector labeling the image 
position. These formulae assume that the time direc- 
tion is much longer than the spatial directions — but this 
is easily corrected by including images in the time di- 
rection as well. The image sums converge fairly quickly 
because the Bessel functions die off exponentially. For 
further discussion of the properties of these functions see 
Refs. [54] and [H. 

The second function is given by 
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QM disc = e(m)(G + m) {Xi _ m)2[Yi _ m? 



(Li — X{){Si — X\) 

(m - Yi) 



£{X{)(G + X,) - £{X{) + t(Xi)(G + Xi) 
£{Y{){G + Yi) - t(Yi) + i(yi)(G + Yi) 



1 



1 



L i -X 1 Sj-Xj rn-Xt Y I -X 1 
1 1 2 



L l -Y l S l -Y 1 m -Y 1 Xj-Yj 



(A8) 



As is clear from this form, this is the taste singlet con- 
tribution to the "disconnected" matrix element, i.e. that 
involving hairpin vertices in the loop. The poles in this 
function are misleading: due to cancellations between 
terms, the function vanishes when Xj — > Yj and is finite 
when Xj or Yi equal L], Si or rji. These cancellations 
could lead to numerical instability in the evaluation of 
A'ldisc, but we have not found this to be the case. Finally, 
we note that A'l° isc is independent of the renormalization 
scale /j D r. 



the operator enumeration used in Ref. [Tlj]. The coeffi- 



cients of the functions are 



^ 64tt 2 / 4 [ x 

B' J 



where B' runs over all five tastes, and B=I, P, V, A and 
T, for j = 6, 7, 8, 9 and 10, respectively. The matrices 
g and h are defined in Ref. [18(, and / is the chiral limit 
value of the pion decay constant. The coefficient of 
is 



2. Definitions of , F (5) and F (6) 

These three functions describe the contributions to 
■Mconn that result from discretization and truncation er- 
rors. Their origin is explained in detail in Ref. [l8j. Their 
forms are 



b n = - 



,(4) 



F (6) = 



IG 



2G £(K B ) 

+ (£(X B )+£(Y B )-2£(K B ))} (A9) 
[(£(X V )+£(Y V )-2£(K V )) 

+ (£(X A ) + £(Y A ) - 2£(K A ))} (A10) 
[£{X T ) + £(Y T ) - 2£(K T )} (All) 



Note that F^ and F^ vanish when m x = m y . 

For completeness, we give the relationships between 
the coefficients of these functions and those appearing in 



while that of F^ is 



&12 = - 



(Cf + 6Cf), (A13) 



■ (AW) 



We can use these formulae to estimate the order of 
magnitude of the coefficients &6-12- One expects that 
C x ~ a 2 Aq CD and/or ~ q; 2 Aq CD , while the numerical 

factors vary from 5 b ' b /(64tt 2 ) - 1/10 to 48/tt 2 - 5. We 
simply treat them as of O(l), and / ~ Aqcd, leading to 
the crude estimate given in Eq. (|35|) . 



Definitions of fS} ] and F< 2) 



These functions give the form of the contributions to 
A^disc that arise from discretization and truncation er- 
rors. The explicit 2 + 1 flavor form of ffp can be deduced 
from the general result given in Ref. [18| : 
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F 



(i) _ 



8/|G 



(Y B -X B ) 2 (L B - VB )(S B -r, B ) 
' {X B ~7 1 B) 2 {Y B -m) 2 {v' B -VB) 



(Y B -X B ) 2 (L B - VB )(S B -r, B ) 



(L B — X B )(S B — X B ) 
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Lb-Yb 
1 



S B -Y B 
1 



X B - Y B Vb -Y b v' b - Y b 



(A15) 



The coefficients of this function are 



^2j4 ( 2C x B 



Cl B ) a 2 S™ A1 



(A16) 



with B=V and A for j = 13 and 14, respectively. 

(2) 

The function F B occurs only in the mixed-action set- 
up. Its form is 



F 



(2) 



mo 



, £(Y B )-£(X B ) 
Yb-Xb 

+ e(x B ) + I(Y B ) 



with the coefficients being 



1 



7T 2 /< 



(2G 



2B 



Gf ) a 2 <5™ A2 : 
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with B=V and A for j = 15 and 16, respectively. 

The functions F B ' are finite — the apparent poles 
have vanishing residues — and vanish when m x = m y . 
They are also independent of the renormalization scale. 

The size of the coefficients 613-16 can be estimated as 
for be-12, with the result given in Eq. (|4T)|) . 



Appendix B: Functions appearing in SU(2) SChPT 
fitting 

In this appendix we describe how SU(2) SChPT results 
are obtained from SU(3) SChPT results using the recipe 
outlined in Sec. Ill CI and justified in Appendix [Cj The 
recipe is to first expand the SU(3) result in powers of 
m x /m s and me/m s (where m s is here meant generically, 
and thus includes m y ), keeping only terms of the desired 
order (here NLO) in SU(2) power-counting. Secondly, 
one allows the LECs which are constants in SU(3) SChPT 
to become arbitrary functions of m s and m y , with the 
exception of /. 



First consider the analytic terms H2—H5 of Eqs. — 
(|2T|) . H2 is proportional to (m x + TOj,)/Aqcd- The 
TOz/Aqcd part remains in SU(2) ChPT, while the 
m y /AQCD part is absorbed into the m s dependence of 
the overall constant Bq. Similarly, H3 oc (m x + m y ) 2 
leads to an analytic term proportional to m 2 , to an m s 
dependence of the analytic term linear in m x , and to an 
m 2 dependence of Bq. H4 oc (m x — m y ) 2 / (m x +m y ) leads 
to analytic terms of all orders in m x , with the expansion 
parameter being m x /m y rather than to^/Aqcd ■ Finally, 
oc (2rri£ + m s ) leads to an analytic term linear in m,£ 
and further m s dependence of Bq. The net result is that 
given in Eqs. (|4HI45[) . with the coefficients dj having an 
arbitrary dependence on the strange quark mass(es). 

Next consider the continuum-like chiral logarithms in 
Hi [see Eq. (|2"2"j) ] . Here a key role is played by the overall 
multiplier 1/G = 1/w 2 p, which arises from the denom- 
inator in the definition of Bk ■ In order for a term from 
M° conn or M° disc to remain of NLO in the SU(2) limit, it 
must come with a compensating factor of G, m y or m s . 
If not, it will become of NNLO in SU(2) SChPT. 

To see how this works in more detail, consider first 
A'lconn, given in Eq. (|A1[) . Its contribution to Hi consists 
of a sum over tastes B of the expression 
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8tt 2 / 2 G 2 8tt 2 / 2 
1 + §Vb) 
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(Bl) 



2 I ) t(K B ) - 2K B l(K B ) 



On the first line, the term proportional to £(Xg) survives 
as a chiral log in the SU(2) limit, but its coefficient sim- 
plifies to 1. This is because Xb/G oc m x /m s (where 
"mj" can include taste-breaking 0(a 2 ) corrections) is 
suppressed in the SU(2) limit, so that, given the overall 
factor of 1/(8tt 2 / 2 ), the X B /G term is of NNLO in SU(2) 
power counting. On the second line, no terms give rise 
to SU(2) chiral logarithms, because the argument of the 
logarithms is finite when either m x or mi vanishes. Thus 
one can expand the logarithms about m x — 0, leading to 
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a power series in m x , in which subsequent terms are sup- 
pressed by m x /m y or m x /A oc Xp/A 2 ,, with A = Aqcd- 
The same is true for the factors multiplying the loga- 
rithms. For example, the last term on the second line 
becomes 



K B e(K B ) K B log[K B ] 

oc 



(m x +m y 



A 2 
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-a 2 A 3 ) 



A 



log [m x + m y + a 2 A 2 ] 



cx fi(m y /A) + / 2 (m„/A)- 



A 
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The analytic dependence on m y in fx can then be ab- 
sorbed into the (unknown) m y dependence of the over- 
all coefficient multiplying Hi [which becomes d\ in the 
SU(2) SChPT result, Eq. gT}]. The dependence on m x , 
which is of NLO, can be absorbed into the coefficient of 
the analytic term Q3. In this way, one finds that the en- 
tire second line can be absorbed into the coefficients d\ 
and di- The overall result is that 



the greatly simplified form 
3 



G 



(L t - Xj)l{Xi) + 1{X!) 



+analytic + NNLO . (B7) 

All the remaining functions which arise in SU(3) 
SChPT, H e -H 16 , become of NNLO in the SU(2) limit. 
This is because all the functions which appear, 

F (2) _ 

F( 6 \ contain a factor of 1/G from the definition of Bk, 
but no counterbalancing factor in the numerator when 
the numerator contains a logarithm of Xb- For example, 
in i<g [see Eq. (IA9[) ] part of the numerator contains a 
factor of G, but this multiplies hi Kb, which gives only 
terms analytic in m x . The part containing In Xb is mul- 
tiplied by Xb, leading to the additional suppression by 
Xb/G — m x /m s . 

The final result, given in Eq. (|4"Tj) , therefore contains 
no LECs induced by discretization or truncation errors. 
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Appendix C: Derivation of recipe for obtaining the 
SU(2) SChPT result 



Turning next to the contribution of [J isc , Eq. 
we find two new features. First, there are logarithms 
(and other functions) of rji . But since r\i = L j/3 + 3Sj /3 
does not vanish when mg and a 2 vanish, these functions 
can be expanded just as logarithms of Yr and Kb were 
above. Second, we need to use the resultFl 



(S7 - A/) su(2) 3 



(B6) 



Third, one can have contributions proportional to 
(m e /A)f(m s /A) (e.g. from the Lj.{Yi)G term) which 
can be absorbed into the coefficient dr. 

Given these results, combined with the arguments used 
above for A4° onn , we find that the only terms in M® isc 
which cannot be absorbed into the coefficients d\ , g?2 and 
^4, are those multiplying In Xj. Of these, we must pick 
out those multiplied by G, since all others are suppressed 
by m x /m s and thus become of NNLO. The result is then 



17 One might be concerned about the legitimacy of the SU(2) limit, 
since ChPT does not control the m 3 dependence when m s be- 
comes large. If one looks back at the source of this ratio in the 
derivation, however, one finds that taking the m B — > 00 limit in 
this way amounts to projecting onto the generators of the ap- 
propriate partially quenched SU(2) subgroup. This is exactly 
analogous to the way in which one projects against the "super- 
7/" by sending rrig — > 00 in a PQ calculation |55fl . In other words, 
it is a mathematical trick and does not imply control over the 
actual m B dependence. 



In this Appendix we demonstrate the validity of the 
recipe used to obtain the SU(2) SCh PT result. This 
recipe has been described in Sec. Ill CI and implemented 
in Appendix [Bj The essential claim is that it is sufficient 
to take the SU(2) limit of the NLO SU(3) expression, 
as long as one allows LECs (except for /) to have an 
arbitrary dependence on m s and m y . This arbitrary de- 
pendence is the only impact of working to all orders in 
r s = to s /AqcdH I n other words, higher orders in SU(3) 
SChPT do not lead to new types of functional depen- 



dence on the small quantities Ap/A 2 , Lp/A 2 ,, a 2 A 2 
and a 2 . 

In particular, looking at the final result of Eqs. (|41ti45|) . 
there are two main features in need of justification: 

1. There are no terms involving those LECs that are 
proportional to a 2 or a 2 . 

2. The chiral logarithm in the function Qi [Eq. (|42l) ] 
is fully predicted. In particular, its coefficient (cx 
l// 2 ) has no dependence on m s . 

Discretization and truncation errors thus enter only 
through the coefficient d\ [see Eq. (141)1) ] and through the 
masses of the PGBs in the chiral logarithms. 

The approach we use to justify the recipe is to use 
SU(3) ChPT, working to all orders in r s , but only at NLO 
in light-quark masses, discretization and truncation ef- 
fects. This amounts to a "poor-person's SU(2) SChPT", 



1 QCD 



18 Here and throughout this Appendix we use m s to represent both 
m s and m v . 
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and allows us to piggy-back on the SU(3) SChPT analysis 
of Ref. [18J . We assume that this all-orders perturbative 
analysis captures all possible forms of dependence on the 
small quantities. 

The form of the analytic terms in the SU(2) expression, 
Eqs. (|43l) -l45 |) . follows immediately in this approach. In- 
deed, since the SU(2) limit of the NLO SU(3) expression 
already gives the most general NLO analytic dependence 
on Xp/A 2 , Lp/A 2 , a 2 Aq CD and a 2 , higher order terms 
can only generate r s dependence of their coefficients. 

We now turn to the first feature noted above. We start 
by displaying the form of the contributions proportional 
to the discretization/truncation LECs that arise in the 
SU(2) limit of the NLO SU(3) expression. Keeping only 
the parts which are non-analytic in light-quark quanti- 
ties, one finds that they all have the form 



SB K 



iSC/(3)@NLO 
Idisc/trunc 



2 M 2 log(M 2 ) 
* G 



(CI) 



We use here a compact notation: a 2 stands for discretiza- 
tion and truncation effects proportional to a 2 , a /ir or 
a 2 Aq CD ; M 2 stands for any light- meson mass-squared 
[Xb, ?7b (except r]i) and Lb] and also (except in the argu- 
ment of the logarithm) for the hairpin vertices a 2 8 



2 cMAl,2 



B 



In addition, the scale /iDR in the logarithm is kept im- 
plicit. The form of ()Cip arises as follows: the a 2 comes 
from the LECs, the M 2 \og(M 2 ) from the pion loop, and 
the l/G from the denominator in the definition of Bk- 
All other factors of Aqcd cancel, as required by dimen- 
sional analysis. As noted in Appendix [Bl the contribu- 
tion of Eq. ([Cl]) is of NNLO in SU(2) power-counting, 
because it is suppressed both by a 2 and by M 2 /G. 

The issue then is whether this double suppression con- 
tinues to hold when one takes the SU(2) limit of higher 
order SU(3) SChPT expressions. If (|C1I) simply gets mul- 
tiplied by a function of r s , then it remains of NNLO in 
SU(2) ChPT. If, however, some higher order terms re- 
duce in the SU(2) limit to 



SB 



K 



unwanted 



r s F{r s )a 2 log(M 2 ) 



(C2) 



then they would contribute at NLO. Here F(r s ) is an 
unknown function, which we assume to be of O(l) in 
SU(2) power-counting for r s ~ 1. The overall factor of 
r s is included because we know from Eq. (|C1[) that there 
is no such term when r s = 0. 

That (|C2|) cannot appear follows from two observa- 
tions. First, higher orders in SU(3) ChPT cannot intro- 
duce an enhancement by G/M 2 cx m s /me. Second, chiral 
logarithms always have the form M 2 ln(Af 2 ), rather than 
simply ln(Af 2 ). Since the factor of a 2 must be present 
as we are considering LECs related to discretization or 
truncation errors, and the l/G arises from the definition 
of Bk OH one necessarily obtains terms of the form (|Clj) 



tion proportional to r B 



• Cr/A?. . This makes no difference to our 



rather than (|C2|) . 

The two observations of the previous paragraph follow 
from ChPT power counting, specifically the generaliza- 
tion of staggered ChPT power counting applicable to a 
calculation of Bk [l8j . This power-counting ensures that 
successive orders in SU(3) SChPT are "suppressed" by 
factors of ra/AQCD or a 2 (using the latter generically). 
Thus it is not possible that by going to higher order one 
can enhance the result by a factor of m s /mi — the best 
that one can do is have a factor of r s ~ 0(1), which does 
not change the order in SU(2) ChPT. 

As for the pion loop contribution, this can appear as 
part of an arbitrary-order SU(3) diagram in which all 
the other particles are "heavy" (i.e. with mass-squared 
proportional to m s ). In the SU(2) limit this diagram 
collapses to give a pion tadpole. The contribution of the 
part of the diagram without the tadpole can be no larger 
than 5Bk ~ a 2 F(r s ), where the a 2 comes from the LEC, 
and SU(3) power-counting ensures that F(r s ) ~ O(l). 
Adding back in the pion loop, and noting that each pion 
field comes explicitly with a factor of 1//, one sees that 
the loop must give the generic form 

Af 2 log(Af 2 ) M 2 \og(M 2 ) M 2 \og(M 2 ) 



(4tt/) 2 



A 2 

x 



G 



where in the last step we are using the fact that G ~ 
A x in SU(2) power counting. In more detail, the loop 
can involve either a flavor non-singlet pion propagator, 
in which case M 2 = Xb, or the flavor singlet propagator 
[e.g. of the form in Eq. flTTl) for B = V, A], in which case 
the M 2 inside the logarithm can be X-q or t^b, while the 
M 2 outside can also be a hairpin vertex proportional to 
a 2 . In all cases one ends up with an expression of the 
NNLO form EU). 



arguments since we treat G ~ in the SU(2) limit. 



We now turn to the second feature of the result Eq. (|4lj) 
that needs to be explained, namely that the SU(2) chiral 
logarithm has a predicted coefficient, despite working to 
all orders in r s . In the continuum, this follows from an 
SU(2) ChPT analysis in which the kaon is treated as 
heavy and serves, along with the four-quark operator, as 
a source for pions 21]. Using the equations of motion, 
it is shown in Ref. [2l| that all operators contributing to 
Bk at one-loop order in SU(2) ChPT (which can have 
any even number of derivatives acting on the external 
kaons) collapse to a single form. Since the coupling to 
pions is determined by SU(2) chiral symmetry, both the 
form of the chiral logarithm and its coefficient relative to 
LO are predicted. The result found in this way is that 
obtained from Eq. (|4lT) by setting taste-breaking in the 
pion masses to zero. 

Our aim in the following is to show that continuum ar- 
gument can be generalized to staggered ChPT with only 
minor changes. Since we have already dealt with the 
contribution from LECs arising from discretization and 
truncation errors in the operator, the remaining such er- 
rors are those coming from 0(a 2 ) taste-breaking effects 



3G 



arising from the action. These enter through the taste- 
breaking potential, V, whose explicit form is given in 
Refs. [11 0|. Thus we must consider SU(3) SChPT di- 
agrams with a single insertion of V and any number of 
powers of r s . 

The potential V leads to two-point, four-point and 
higher-order vertices, all multiplied by an explicit factor 
of a 2 . Those involving four-point or higher-order vertices 
give contributions to Bk of the form (|C1|) , which are thus 
of NNLO in the SU(2) limit. The argument for this is 
exactly as given above for the discretization/truncation 
LECs. Thus we need only account for the two-point 
0(a 2 ) vertices, which only affect propagators. These will 
only have an impact on the SU(2) chiral logarithm if 
these vertices are included in the "light" meson propa- 
gators. Including them on "heavy" meson propagators 
will lead only to corrections of the NNLO form (jCll) . 
The upshot is that we should work to all orders in r s in 
continuum SU(3) ChPT, keeping 0(a 2 ) corrections only 
on the single light meson propagator needed to develop 
the non-analyticity. But working to all orders in r s in 
SU(3) ChPT can be achieved by using continuum SU(2) 
ChPT — indeed, the latter goes beyond perturbation the- 
ory. Thus we end up with the following recipe: use the 
continuum SU(2) ChPT calculation of Ref. [2l[ to deter- 
mine the Feynman diagrams and their overall factors, but 
evaluate the integrals using the light-meson propagators 
including the 0{a 2 ) corrections. 

In fact, we must generalize the calculation of Ref. [2l[ 
to include the extra taste degree of freedom. This is 
straightforward, and follows the methodology used in 
Ref. [H]. We find that, just as in SU(3) ChPT, adding 
taste results in the LO chiral operator having two terms: 

O x K oc Str (F^K) Str (F 2 £K) + Str {F 1 ^KF 2 ^K) . (C3) 

Here the pion fields are contained in £ = y/S = 
exp(z$/[2/]), while F\p are spurions which pick out the 
flavor and taste of the underlying quark operator. The 
argument for the equality of the coefficients of the two 
terms is identical to that for SU(3) case given in Ref. [LSj . 
Expanding the operator (IC3|) to quadratic order in pion 
fields, removing contributions which are common to the 
factor of f K in the denominator of Bk and thus cancel, 
and contracting pion fields with staggered ChPT propa- 
gators leads to Eq. (|42|) . 

Appendix D: Tables of am, and Bk 

In this appendix we present the most useful subset of 
our results for the "pion" masses and Bk- We give results 
only for the three ensembles, C3, Fl and SI, which we 
use to do our continuum extrapolation. We also quote 



only the degenerate pion masses, since oc m x + m y to 
very good approximation. Finally, for Bk we show the 
subset of our data used in the SU(2) fits. These arc the 
results that lead our central value for Bk ■ Table IXXXII 
reports the average pion masses obtained as described in 
the main text. Table IXXXIII reports the values of Bk 
obtained using 1-loop matching at the scale /i = 1/a. 



TABLE XXXI. am, (Goldstone taste, £ 5 ) for the C3, Fl and 
SI ensembles. 



quark comb. 


C3 


Fl 


SI 


i-i 


0.1342(2) 


0.0902(3) 


0.0596(2) 


2-2 


0.1861(2) 


0.1244(2) 


0.0819(2) 


3-3 


0.2258(2) 


0.1507(2) 


0.0992(1) 


4-4 


0.2593(2) 


0.1728(2) 


0.1139(1) 


5-5 


0.2888(2) 


0.1924(2) 


0.1269(1) 


6-6 


0.3155(2) 


0.2101(2) 


0.1387(1) 


7-7 


0.3402(2) 


0.2266(2) 


0.1496(1) 


8-8 


0.3634(2) 


0.2419(2) 


0.1599(1) 


9-9 


0.3852(2) 


0.2565(2) 


0.1695(1) 


10-10 


0.4060(2) 


0.2703(2) 


0.1788(1) 


TABLE XXXII. 


Results for Bk(1/cl) on the C3, Fl, and SI 


ensembles. 








quark comb. 


C3 


Fl 


SI 


1-1 


0.438(12) 


0.360(33) 


0.357(23) 


2-2 


0.4815(46) 


0.434(14) 


0.4147(73) 


3-3 


0.5132(27) 


0.4699(77) 


0.4455(41) 


4-4 


0.5363(19) 


0.4928(53) 


0.4689(27) 


5-5 


0.5547(15) 


0.5118(40) 


0.4878(21) 


6-6 


0.5701(12) 


0.5283(33) 


0.5036(17) 


7-7 


0.5835(10) 


0.5426(27) 


0.5173(14) 


8-8 


0.5954(9) 


0.5553(23) 


0.5294(13) 


9-9 


0.6061(8) 


0.5667(20) 


0.5403(12) 


10-10 


0.6159(7) 


0.5771(18) 


0.5502(11) 


1-8 


0.5735(38) 


0.5243(97) 


0.4908(58) 


2-8 


0.5674(21) 


0.5215(55) 


0.4915(30) 


3-8 


0.5690(16) 


0.5248(42) 


0.4978(22) 


4-8 


0.5732(13) 


0.5303(35) 


0.5045(18) 


5-8 


0.5784(11) 


0.5365(31) 


0.5111(16) 


1-9 


0.5811(38) 


0.5329(95) 


0.4992(57) 


2-9 


0.5759(20) 


0.5304(53) 


0.5000(29) 


3-9 


0.5772(15) 


0.5332(40) 


0.5058(21) 


4-9 


0.5809(12) 


0.5382(33) 


0.5119(17) 


5-9 


0.5855(11) 


0.5439(29) 


0.5180(15) 


1-10 


0.5879(38) 


0.5410(93) 


0.5070(56) 


2-10 


0.5835(20) 


0.5386(51) 


0.5078(29) 


3-10 


0.5848(14) 


0.5410(38) 


0.5132(20) 


4-10 


0.5881(12) 


0.5455(31) 


0.5188(17) 


5-10 


0.5922(10) 


0.5507(27) 


0.5245(15) 
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